Why do most educators and psychologists strongly believe that grade-equivalent scores should not be used?
Explanation
Grade-equivalent scores are particularly problematic because they sound intuitive but are highly misleading. A seventh-grader scoring at a '10th-grade equivalent' has mastered seventh-grade material exceptionally well, not that they are ready for tenth-grade curriculum.
Other questions
Which type of assessment is described as occurring before or during instruction to guide the teacher in planning and to help students improve their learning?
In which type of test interpretation is a student's score compared to a fixed standard or a minimum passing score?
For commercially produced standardized tests, what reliability coefficient is generally considered 'not very good'?
What is the most important consideration when judging a test, which refers to whether the decisions and inferences based on the test scores are supported by evidence?
What form of assessment bias occurs when qualities of an assessment instrument offend or unfairly penalize a group of students based on their gender, race, socioeconomic status, or religion?
Multiple-choice questions, matching exercises, and true/false statements are all types of which kind of testing?
In a multiple-choice question, what is the term for the wrong answer choices that are designed to distract students who have only partial understanding?
According to the text, for efficiency, essay tests should be limited to the assessment of what type of learning outcomes?
What type of assessment asks students to apply skills and abilities as they would in real life, for example, by using fractions to enlarge a recipe?
What is the term for a systematic collection of student work that often includes works in progress, revisions, and self-analyses to demonstrate learning?
What are the rules used to determine the quality of a student performance, often on a 4-point scale from 'excellent' to 'inadequate'?
What common type of norm-referenced grading arbitrarily limits the number of good grades that can be given and is based on the student's standing in comparison with others?
According to Margaret Clifford's research on the effects of failure, what should educators replace easy success with to encourage learning?
What is the term for the arithmetical average of a group of scores, which is a measure of central tendency?
What is the statistical measure that indicates how widely scores in a distribution vary from the mean?
In a normal distribution, what percentage of all scores is located in the area from 1 standard deviation below the mean to 1 standard deviation above the mean?
What does a percentile rank of 75 on a standardized test indicate?
If a student gets a raw score of 78 on a test where the mean was 70 and the standard deviation was 4, what is the student's z score?
What type of standard score has a mean of 50 and a standard deviation of 10, designed to eliminate negative numbers and decimals found in z-scores?
Stanine scores are standard scores that range from 1 to 9. What are the mean and standard deviation of the stanine scale?
In a 2012 survey of the nation's top teachers, what percentage felt that, on balance, standardized tests do more harm than good?
Which of the following is described in Table 15.5 as an inappropriate use of high-stakes test results?
What is the term for measures that use statistical procedures to assess actual student growth compared to a baseline of expected average growth, based on data from previous years?
What is the name of the consortium of states developing a common set of K–12 assessments in English and math anchored in what it takes to be ready for college and careers?
Which term is broader and includes all kinds of ways to sample and observe students’ skills, knowledge, and abilities, including unit tests, informal observations, and portfolios?
What type of reliability is calculated by comparing performance on half of the test questions with performance on the other half, for instance, by comparing scores on odd- versus even-numbered items?
What is the term for the range of scores that might include a student's true score, which is reported on standardized tests to account for the standard error of measurement?
What type of fairness issue occurs when some groups of students may not have had an equal opportunity to show what they know on a test, such as when the language of the test differs from the students' languages?
In the example provided by Popham for a fourth-grade test item, students were asked to identify the sentence where the word 'field' means the same as in 'My uncle’s field is computer programming.' Why is this item potentially biased?
According to the guidelines for writing objective test items, what should a teacher do if they must use negative words such as 'not' or 'except' in the stem of a multiple-choice question?
What is a recommended final check on the fairness of grading for essay tests?
Any form of assessment that requires students to carry out an activity or produce a product to demonstrate learning is known as what?
What type of performance test is described as having the additional features of being public and often serving as the culminating experience of a whole program of study?
Journals, student observations, checklists, and student self-assessment are all examples of what type of ungraded, formative assessment?
One study in North Carolina found that kindergarten retention had more than doubled from 1992 to 2002. What percentage of students were retained in 2002?
Which of the following is listed as a method for communicating with and reporting to families, beyond just sending home grades?
Which measure of central tendency is the score that occurs most often in a group of scores?
What is one caution for interpreting percentile scores mentioned in the text?
What is one documented problem with high-stakes testing, where the content of the exam eventually defines what is taught in the classroom?
If a teacher uses an ungraded pretest to determine what students already know before beginning a new unit, this is an example of what type of assessment?
What is the term for the large sample of students serving as a comparison group for scoring standardized tests?
What does the text advise about using the choices 'all of the above' and 'none of the above' in multiple-choice tests?
According to James Popham, what is a key characteristic of a good skill-focused rubric?
What is one stated advantage of criterion-referenced grading systems?
If a student scores 700 on an SAT test, where the mean is about 500 and the standard deviation is about 100, what percentage of test-takers did that student outperform?
To be considered valuable and used well, a high-stakes testing program must match the content standards of the district, provide alternate assessment strategies for students with disabilities, and do what else?
What is the relationship between test reliability and validity?
According to the section on involving students in assessments, which of the following is a suggested activity to develop students' sense of efficacy for learning?
Studies have found that in some states, about what percentage of instructional time in elementary schools is spent preparing for end-of-grade tests?