"...there seems to be no escape from the conclusions that the two types of exams are measuring identical things" (Paterson, 1926, p. 246). This conclusion should not be surprising; after all, a well written essay item requires that the student (1) have a store of knowledge, (2) be able to relate facts and principles, and (3) be able to organize such information into a coherent and logical written expression, whereas an objective test item requires that the student (1) have a store of knowledge, (2) be able to relate facts and principles, and (3) be able to organize such information into a coherent and logical choice among several alternatives.
9.
TRUE
Both objective and essay test items are good devices for measuring student achievement. However, as seen in the previous quiz answers, there are particular measurement situations where one item type is more appropriate than the other. Following is a set of recommendations for using either objective or essay test items: (Adapted from Robert L. Ebel, Essentials of Educational Measurement, 1972, p. 144).
1 Sax, G., & Collet, L. S. (1968). An empirical comparison of the effects of recall and multiple-choice tests on student achievement. J ournal of Educational Measurement, 5 (2), 169–173. doi:10.1111/j.1745-3984.1968.tb00622.x
Paterson, D. G. (1926). Do new and old type examinations measure different mental functions? School and Society, 24 , 246–248.
When to Use Essay or Objective Tests
Essay tests are especially appropriate when:
the group to be tested is small and the test is not to be reused.
you wish to encourage and reward the development of student skill in writing.
you are more interested in exploring the student's attitudes than in measuring his/her achievement.
you are more confident of your ability as a critical and fair reader than as an imaginative writer of good objective test items.
Objective tests are especially appropriate when:
the group to be tested is large and the test may be reused.
highly reliable test scores must be obtained as efficiently as possible.
impartiality of evaluation, absolute fairness, and freedom from possible test scoring influences (e.g., fatigue, lack of anonymity) are essential.
you are more confident of your ability to express objective test items clearly than of your ability to judge essay test answers correctly.
there is more pressure for speedy reporting of scores than for speedy test preparation.
Either essay or objective tests can be used to:
measure almost any important educational achievement a written test can measure.
test understanding and ability to apply principles.
test ability to think critically.
test ability to solve problems.
test ability to select relevant facts and principles and to integrate them toward the solution of complex problems.
In addition to the preceding suggestions, it is important to realize that certain item types are better suited than others for measuring particular learning objectives. For example, learning objectives requiring the student to demonstrate or to show , may be better measured by performance test items, whereas objectives requiring the student to explain or to describe may be better measured by essay test items. The matching of learning objective expectations with certain item types can help you select an appropriate kind of test item for your classroom exam as well as provide a higher degree of test validity (i.e., testing what is supposed to be tested). To further illustrate, several sample learning objectives and appropriate test items are provided on the following page.
Learning Objectives
Most Suitable Test Item
The student will be able to categorize and name the parts of the human skeletal system.
Objective Test Item (M-C, T-F, Matching)
The student will be able to critique and appraise another student's English composition on the basis of its organization.
Essay Test Item (Extended-Response)
The student will demonstrate safe laboratory skills.
Performance Test Item
The student will be able to cite four examples of satire that Twain uses in .
Essay Test Item (Short-Answer)
After you have decided to use either an objective, essay or both objective and essay exam, the next step is to select the kind(s) of objective or essay item that you wish to include on the exam. To help you make such a choice, the different kinds of objective and essay items are presented in the following section. The various kinds of items are briefly described and compared to one another in terms of their advantages and limitations for use. Also presented is a set of general suggestions for the construction of each item variation.
II. Suggestions for Using and Writing Test Items
The multiple-choice item consists of two parts: (a) the stem, which identifies the question or problem and (b) the response alternatives. Students are asked to select the one alternative that best completes the statement or answers the question. For example:
Sample Multiple-Choice Item
(a)
(b)
*correct response
Advantages in Using Multiple-Choice Items
Multiple-choice items can provide...
versatility in measuring all levels of cognitive ability.
highly reliable test scores.
scoring efficiency and accuracy.
objective measurement of student achievement or ability.
a wide sampling of content or objectives.
a reduced guessing factor when compared to true-false items.
different response alternatives which can provide diagnostic feedback.
Limitations in Using Multiple-Choice Items
Multiple-choice items...
are difficult and time consuming to construct.
lead an instructor to favor simple recall of facts.
place a high degree of dependence on the student's reading ability and instructor's writing ability.
Suggestions For Writing Multiple-Choice Test Items
1. When possible, state the stem as a direct question rather than as an incomplete statement.
Undesirable:
Desirable:
2. Present a definite, explicit and singular question or problem in the stem.
Undesirable:
Desirable:
3. Eliminate excessive verbiage or irrelevant information from the stem.
Undesirable:
Desirable:
4. Include in the stem any word(s) that might otherwise be repeated in each alternative.
Undesirable:
5. Use negatively stated stems sparingly. When used, underline and/or capitalize the negative word.
Undesirable:
Desirable:
Item Alternatives
6. Make all alternatives plausible and attractive to the less knowledgeable or skillful student.
Undesirable
Desirable
7. Make the alternatives grammatically parallel with each other, and consistent with the stem.
Undesirable:
8. Make the alternatives mutually exclusive.
Undesirable:
The daily minimum required amount of milk that a 10 year old child should drink is
9. When possible, present alternatives in some logical order (e.g., chronological, most to least, alphabetical).
Undesirable
Desirable
10. Be sure there is only one correct or best response to the item.
Undesirable:
11. Make alternatives approximately equal in length.
Undesirable:
12. Avoid irrelevant clues such as grammatical structure, well known verbal associations or connections between stem and answer.
Undesirable: (grammatical clue)
of water behind the dam.
13. Use at least four alternatives for each item to lower the probability of getting the item correct by guessing.
14. Randomly distribute the correct response among the alternative positions throughout the test having approximately the same proportion of alternatives a, b, c, d and e as the correct response.
15. Use the alternatives "none of the above" and "all of the above" sparingly. When used, such alternatives should occasionally be used as the correct response.
A true-false item can be written in one of three forms: simple, complex, or compound. Answers can consist of only two choices (simple), more than two choices (complex), or two choices plus a conditional completion response (compound). An example of each type of true-false item follows:
Sample True-False Item: Simple
The acquisition of morality is a developmental process.
True
False
Sample True-False Item: Complex
Sample true-false item: compound.
The acquisition of morality is a developmental process.
True
False
Advantages In Using True-False Items
True-False items can provide...
the widest sampling of content or objectives per unit of testing time.
an objective measurement of student achievement or ability.
Limitations In Using True-False Items
True-false items...
incorporate an extremely high guessing factor. For simple true-false items, each student has a 50/50 chance of correctly answering the item without any knowledge of the item's content.
can often lead an instructor to write ambiguous statements due to the difficulty of writing statements which are unequivocally true or false.
do not discriminate between students of varying ability as well as other item types.
can often include more irrelevant clues than do other item types.
can often lead an instructor to favor testing of trivial knowledge.
Suggestions For Writing True-False Test Items
1. Base true-false items upon statements that are absolutely true or false, without qualifications or exceptions.
Undesirable:
Desirable:
2. Express the item statement as simply and as clearly as possible.
Undesirable:
Desirable:
3. Express a single idea in each test item.
Undesirable:
Desirable:
4. Include enough background information and qualifications so that the ability to respond correctly to the item does not depend on some special, uncommon knowledge.
Undesirable:
Desirable:
5. Avoid lifting statements from the text, lecture or other materials so that memory alone will not permit a correct answer.
Undesirable:
Desirable:
6. Avoid using negatively stated item statements.
Undesirable:
Desirable:
7. Avoid the use of unfamiliar vocabulary.
Undesirable:
Desirable:
8. Avoid the use of specific determiners which would permit a test-wise but unprepared examinee to respond correctly. Specific determiners refer to sweeping terms like "all," "always," "none," "never," "impossible," "inevitable," etc. Statements including such terms are likely to be false. On the other hand, statements using qualifying determiners such as "usually," "sometimes," "often," etc., are likely to be true. When statements do require the use of specific determiners, make sure they appear in both true and false items.
Undesirable:
required to rule on the constitutionality of a law. (T)
easier to score than an essay test. (T)
Desirable:
180°. (T)
other molecule of that compound. (T)
used for the metering of electrical energy used in a home. (F)
9. False items tend to discriminate more highly than true items. Therefore, use more false items than true items (but no more than 15% additional false items).
In general, matching items consist of a column of stimuli presented on the left side of the exam page and a column of responses placed on the right side of the page. Students are required to match the response associated with a given stimulus. For example:
Sample Matching Test Item
Advantages In Using Matching Items
Matching items...
require short periods of reading and response time, allowing you to cover more content.
provide objective measurement of student achievement or ability.
provide highly reliable test scores.
provide scoring efficiency and accuracy.
Limitations in Using Matching Items
have difficulty measuring learning objectives requiring more than simple recall of information.
are difficult to construct due to the problem of selecting a common set of stimuli and responses.
Suggestions for Writing Matching Test Items
1. Include directions which clearly state the basis for matching the stimuli with the responses. Explain whether or not a response can be used more than once and indicate where to write the answer.
Undesirable:
Desirable:
2. Use only homogeneous material in matching items.
Undesirable:
1.
2.
3.
4.
5.
a.
b.
c.
d. O
e.
f.
Desirable:
1.
2.
3.
4.
a. SO
b.
c.
d. O
e. HCl
3. Arrange the list of responses in some systematic order if possible (e.g., chronological, alphabetical).
Undesirable
Desirable
1.
2.
3.
4.
a.
b.
c.
d.
e.
a.
b.
c.
d.
e.
4. Avoid grammatical or other clues to the correct response.
Undesirable:
1.
2.
3.
4.
Desirable:
5. Keep matching items brief, limiting the list of stimuli to under 10.
6. Include more responses than stimuli to help prevent answering through the process of elimination.
7. When possible, reduce the amount of reading time by including only short phrases or single words in the response list.
The completion item requires the student to answer a question or to finish an incomplete statement by filling in a blank with the correct word or phrase. For example,
Sample Completion Item
According to Freud, personality is made up of three major systems, the _________, the ________ and the ________.
Advantages in Using Completion Items
Completion items...
can provide a wide sampling of content.
can efficiently measure lower levels of cognitive ability.
can minimize guessing as compared to multiple-choice or true-false items.
can usually provide an objective measure of student achievement or ability.
Limitations of Using Completion Items
are difficult to construct so that the desired response is clearly indicated.
are more time consuming to score when compared to multiple-choice or true-false items.
are more difficult to score since more than one answer may have to be considered correct if the item was not properly prepared.
Suggestions for Writing Completion Test Items
1. Omit only significant words from the statement.
Undesirable:
called a nucleus.
Desirable:
.
2. Do not omit so many words from the statement that the intended meaning is lost.
Undesirable:
Desirable:
3. Avoid grammatical or other clues to the correct response.
Undesirable:
decimal system.
Desirable:
4. Be sure there is only one correct response.
Undesirable:
.
Desirable:
.
5. Make the blanks of equal length.
Undesirable:
and (Juno) .
Desirable:
and (Juno) .
6. When possible, delete words at the end of the statement after the student has been presented a clearly defined problem.
Undesirable:
.
Desirable:
is (122.5) .
7. Avoid lifting statements directly from the text, lecture or other sources.
8. Limit the required response to a single word or phrase.
The essay test is probably the most popular of all types of teacher-made tests. In general, a classroom essay test consists of a small number of questions to which the student is expected to demonstrate his/her ability to (a) recall factual knowledge, (b) organize this knowledge and (c) present the knowledge in a logical, integrated answer to the question. An essay test item can be classified as either an extended-response essay item or a short-answer essay item. The latter calls for a more restricted or limited answer in terms of form or scope. An example of each type of essay item follows.
Sample Extended-Response Essay Item
Explain the difference between the S-R (Stimulus-Response) and the S-O-R (Stimulus-Organism-Response) theories of personality. Include in your answer (a) brief descriptions of both theories, (b) supporters of both theories and (c) research methods used to study each of the two theories. (10 pts. 20 minutes)
Sample Short-Answer Essay Item
Identify research methods used to study the S-R (Stimulus-Response) and S-O-R (Stimulus-Organism-Response) theories of personality. (5 pts. 10 minutes)
Advantages In Using Essay Items
Essay items...
are easier and less time consuming to construct than are most other item types.
provide a means for testing student's ability to compose an answer and present it in a logical manner.
can efficiently measure higher order cognitive objectives (e.g., analysis, synthesis, evaluation).
Limitations In Using Essay Items
cannot measure a large amount of content or objectives.
generally provide low test and test scorer reliability.
require an extensive amount of instructor's time to read and grade.
generally do not provide an objective measure of student achievement or ability (subject to bias on the part of the grader).
Suggestions for Writing Essay Test Items
1. Prepare essay items that elicit the type of behavior you want to measure.
Learning Objective:
The student will be able to explain how the normal curve serves as a statistical model.
Undesirable:
Describe a normal curve in terms of: symmetry, modality, kurtosis and skewness.
Desirable:
Briefly explain how the normal curve serves as a statistical model for estimation and hypothesis testing.
2. Phrase each item so that the student's task is clearly indicated.
Undesirable:
Discuss the economic factors which led to the stock market crash of 1929.
Desirable:
Identify the three major economic conditions which led to the stock market crash of 1929. Discuss briefly each condition in correct chronological sequence and in one paragraph indicate how the three factors were inter-related.
3. Indicate for each item a point value or weight and an estimated time limit for answering.
Undesirable:
Compare the writings of Bret Harte and Mark Twain in terms of settings, depth of characterization, and dialogue styles of their main characters.
Desirable:
Compare the writings of Bret Harte and Mark Twain in terms of settings, depth of characterization, and dialogue styles of their main characters. (10 points 20 minutes)
4. Ask questions that will elicit responses on which experts could agree that one answer is better than another.
5. Avoid giving the student a choice among optional items as this greatly reduces the reliability of the test.
6. It is generally recommended for classroom examinations to administer several short-answer items rather than only one or two extended-response items.
Suggestions for Scoring Essay Items
ANALYTICAL SCORING:
Each answer is compared to an ideal answer and points are assigned for the inclusion of necessary elements. Grades are based on the number of accumulated points either absolutely (i.e., A=10 or more points, B=6-9 pts., etc.) or relatively (A=top 15% scores, B=next 30% of scores, etc.)
GLOBAL QUALITY:
Each answer is read and assigned a score (e.g., grade, total points) based either on the total quality of the response or on the total quality of the response relative to other student answers.
Examples Essay Item and Grading Models
"Americans are a mixed-up people with no sense of ethical values. Everyone knows that baseball is far less necessary than food and steel, yet they pay ball players a lot more than farmers and steelworkers."
WHY? Use 3-4 sentences to indicate how an economist would explain the above situation.
Analytical Scoring
Global Quality
Assign scores or grades on the overall quality of the written response as compared to an ideal answer. Or, compare the overall quality of a response to other student responses by sorting the papers into three stacks:
Read and sort each stack again divide into three more stacks
In total, nine discriminations can be used to assign test grades in this manner. The number of stacks or discriminations can vary to meet your needs.
Try not to allow factors which are irrelevant to the learning outcomes being measured affect your grading (i.e., handwriting, spelling, neatness).
Read and grade all class answers to one item before going on to the next item.
Read and grade the answers without looking at the students' names to avoid possible preferential treatment.
Occasionally shuffle papers during the reading of answers to help avoid any systematic order effects (i.e., Sally's "B" work always followed Jim's "A" work thus it looked more like "C" work).
When possible, ask another instructor to read and grade your students' responses.
Another form of a subjective test item is the problem solving or computational exam question. Such items present the student with a problem situation or task and require a demonstration of work procedures and a correct solution, or just a correct solution. This kind of test item is classified as a subjective type of item due to the procedures used to score item responses. Instructors can assign full or partial credit to either correct or incorrect solutions depending on the quality and kind of work procedures presented. An example of a problem solving test item follows.
Example Problem Solving Test Item
It was calculated that 75 men could complete a strip on a new highway in 70 days. When work was scheduled to commence, it was found necessary to send 25 men on another road project. How many days longer will it take to complete the strip? Show your work for full or partial credit.
Advantages In Using Problem Solving Items
Problem solving items...
minimize guessing by requiring the students to provide an original response rather than to select from several alternatives.
are easier to construct than are multiple-choice or matching items.
can most appropriately measure learning objectives which focus on the ability to apply skills or knowledge in the solution of problems.
can measure an extensive amount of content or objectives.
Limitations in Using Problem Solving Items
require an extensive amount of instructor time to read and grade.
generally do not provide an objective measure of student achievement or ability (subject to bias on the part of the grader when partial credit is given).
Suggestions For Writing Problem Solving Test Items
1. Clearly identify and explain the problem.
Undesirable:
Desirable:
2. Provide directions which clearly inform the student of the type of response called for.
Undesirable:
Desirable:
3. State in the directions whether or not the student must show his/her work procedures for full or partial credit.
Undesirable:
Desirable:
4. Clearly separate item parts and indicate their point values.
A man leaves his home and drives to a convention at an average rate of 50 miles per hour. Upon arrival, he finds a telegram advising him to return at once. He catches a plane that takes him back at an average rate of 300 miles per hour.
Undesirable:
Desirable:
5. Use figures, conditions and situations which create a realistic problem.
Undesirable:
Desirable:
6. Ask questions that elicit responses on which experts could agree that one solution and one or more work procedures are better than others.
7. Work through each problem before classroom administration to double-check accuracy.
A performance test item is designed to assess the ability of a student to perform correctly in a simulated situation (i.e., a situation in which the student will be ultimately expected to apply his/her learning). The concept of simulation is central in performance testing; a performance test will simulate to some degree a real life situation to accomplish the assessment. In theory, a performance test could be constructed for any skill and real life situation. In practice, most performance tests have been developed for the assessment of vocational, managerial, administrative, leadership, communication, interpersonal and physical education skills in various simulated situations. An illustrative example of a performance test item is provided below.
Sample Performance Test Item
Assume that some of the instructional objectives of an urban planning course include the development of the student's ability to effectively use the principles covered in the course in various "real life" situations common for an urban planning professional. A performance test item could measure this development by presenting the student with a specific situation which represents a "real life" situation. For example,
An urban planning board makes a last minute request for the professional to act as consultant and critique a written proposal which is to be considered in a board meeting that very evening. The professional arrives before the meeting and has one hour to analyze the written proposal and prepare his critique. The critique presentation is then made verbally during the board meeting; reactions of members of the board or the audience include requests for explanation of specific points or informed attacks on the positions taken by the professional.
The performance test designed to simulate this situation would require that the student to be tested role play the professional's part, while students or faculty act the other roles in the situation. Various aspects of the "professional's" performance would then be observed and rated by several judges with the necessary background. The ratings could then be used both to provide the student with a diagnosis of his/her strengths and weaknesses and to contribute to an overall summary evaluation of the student's abilities.
Advantages In Using Performance Test Items
Performance test items...
can most appropriately measure learning objectives which focus on the ability of the students to apply skills or knowledge in real life situations.
usually provide a degree of test validity not possible with standard paper and pencil test items.
are useful for measuring learning objectives in the psychomotor domain.
Limitations In Using Performance Test Items
are difficult and time consuming to construct.
are primarily used for testing students individually and not for testing groups. Consequently, they are relatively costly, time consuming, and inconvenient forms of testing.
generally do not provide an objective measure of student achievement or ability (subject to bias on the part of the observer/grader).
Suggestions For Writing Performance Test Items
Prepare items that elicit the type of behavior you want to measure.
Clearly identify and explain the simulated situation to the student.
Make the simulated situation as "life-like" as possible.
Provide directions which clearly inform the students of the type of response called for.
When appropriate, clearly state time and activity limitations in the directions.
Adequately train the observer(s)/scorer(s) to ensure that they are fair in scoring the appropriate behaviors.
III. TWO METHODS FOR ASSESSING TEST ITEM QUALITY
This section presents two methods for collecting feedback on the quality of your test items. The two methods include using self-review checklists and student evaluation of test item quality. You can use the information gathered from either method to identify strengths and weaknesses in your item writing.
Checklist for Evaluating Test Items
EVALUATE YOUR TEST ITEMS BY CHECKING THE SUGGESTIONS WHICH YOU FEEL YOU HAVE FOLLOWED.
____
When possible, stated the stem as a direct question rather than as an incomplete statement.
____
Presented a definite, explicit and singular question or problem in the stem.
____
Eliminated excessive verbiage or irrelevant information from the stem.
____
Included in the stem any word(s) that might have otherwise been repeated in each alternative.
____
Used negatively stated stems sparingly. When used, underlined and/or capitalized the negative word(s).
____
Made all alternatives plausible and attractive to the less knowledgeable or skillful student.
____
Made the alternatives grammatically parallel with each other, and consistent with the stem.
____
Made the alternatives mutually exclusive.
____
When possible, presented alternatives in some logical order (e.g., chronologically, most to least).
____
Made sure there was only one correct or best response per item.
____
Made alternatives approximately equal in length.
____
Avoided irrelevant clues such as grammatical structure, well known verbal associations or connections between stem and answer.
____
Used at least four alternatives for each item.
____
Randomly distributed the correct response among the alternative positions throughout the test having approximately the same proportion of alternatives a, b, c, d, and e as the correct response.
____
Used the alternatives "none of the above" and "all of the above" sparingly. When used, such alternatives were occasionally the correct response.
____
Based true-false items upon statements that are absolutely true or false, without qualifications or exceptions.
____
Expressed the item statement as simply and as clearly as possible.
____
Expressed a single idea in each test item.
____
Included enough background information and qualifications so that the ability to respond correctly did not depend on some special, uncommon knowledge.
____
Avoided lifting statements from the text, lecture, or other materials.
____
Avoided using negatively stated item statements.
____
Avoided the use of unfamiliar language.
____
Avoided the use of specific determiners such as "all," "always," "none," "never," etc., and qualifying determiners such as "usually," "sometimes," "often," etc.
____
Used more false items than true items (but not more than 15% additional false items).
____
Included directions which clearly stated the basis for matching the stimuli with the response.
____
Explained whether or not a response could be used more than once and indicated where to write the answer.
____
Used only homogeneous material.
____
When possible, arranged the list of responses in some systematic order (e.g., chronologically, alphabetically).
____
Avoided grammatical or other clues to the correct response.
____
Kept items brief (limited the list of stimuli to under 10).
____
Included more responses than stimuli.
____
When possible, reduced the amount of reading time by including only short phrases or single words in the response list.
____
Omitted only significant words from the statement.
____
Did not omit so many words from the statement that the intended meaning was lost.
____
Avoided grammatical or other clues to the correct response.
____
Included only one correct response per item.
____
Made the blanks of equal length.
____
When possible, deleted the words at the end of the statement after the student was presented with a clearly defined problem.
____
Avoided lifting statements directly from the text, lecture, or other sources.
____
Limited the required response to a single word or phrase.
____
Prepared items that elicited the type of behavior you wanted to measure.
____
Phrased each item so that the student's task was clearly indicated.
____
Indicated for each item a point value or weight and an estimated time limit for answering.
____
Asked questions that elicited responses on which experts could agree that one answer is better than others.
____
Avoided giving the student a choice among optional items.
____
Administered several short-answer items rather than 1 or 2 extended-response items.
Grading Essay Test Items
____
Selected an appropriate grading model.
____
Tried not to allow factors which were irrelevant to the learning outcomes being measured to affect your grading (e.g., handwriting, spelling, neatness).
____
Read and graded all class answers to one item before going on to the next item.
____
Read and graded the answers without looking at the student's name to avoid possible preferential treatment.
____
Occasionally shuffled papers during the reading of answers.
____
When possible, asked another instructor to read and grade your students' responses.
____
Clearly identified and explained the problem to the student.
____
Provided directions which clearly informed the student of the type of response called for.
____
Stated in the directions whether or not the student must show work procedures for full or partial credit.
____
Clearly separated item parts and indicated their point values.
____
Used figures, conditions and situations which created a realistic problem.
____
Asked questions that elicited responses on which experts could agree that one solution and one or more work procedures are better than others.
____
Worked through each problem before classroom administration.
____
Prepared items that elicit the type of behavior you wanted to measure.
____
Clearly identified and explained the simulated situation to the student.
____
Made the simulated situation as "life-like" as possible.
____
Provided directions which clearly inform the students of the type of response called for.
____
When appropriate, clearly stated time and activity limitations in the directions.
____
Adequately trained the observer(s)/scorer(s) to ensure that they were fair in scoring the appropriate behaviors.
STUDENT EVALUATION OF TEST ITEM QUALITY
Using ices questionnaire items to assess your test item quality .
The following set of ICES (Instructor and Course Evaluation System) questionnaire items can be used to assess the quality of your test items. The items are presented with their original ICES catalogue number. You are encouraged to include one or more of the items on the ICES evaluation form in order to collect student opinion of your item writing quality.
102--How would you rate the instructor's examination questions?
116--Did the exams challenge you to do original thinking?
Excellent
Poor
Yes, very challenging
No, not challenging
103--How well did examination questions reflect content and emphasis of the course?
118--Were there "trick" or trite questions on tests?
Well related
Poorly related
Lots of them
Few if any
114--The exams reflected important points in the reading assignments.
122--How difficult were the examinations?
Strongly agree
Strongly disagree
Too difficult
Too easy
119--Were exam questions worded clearly?
123--I found I could score reasonably well on exams by just cramming.
Yes, very clear
No, very unclear
Strongly agree
Strongly disagree
115--Were the instructor's test questions thought provoking?
121--How was the length of exams for the time allotted.
Definitely yes
Definitely no
Too long
Too short
125--Were exams adequately discussed upon return?
109--Were exams, papers, reports returned with errors explained or personal comments?
Yes, adequately
No, not enough
Almost always
Almost never
IV. ASSISTANCE OFFERED BY THE CENTER FOR INNOVATION IN TEACHING AND LEARNING (CITL)
The information on this page is intended for self-instruction. However, CITL staff members will consult with faculty who wish to analyze and improve their test item writing. The staff can also consult with faculty about other instructional problems. Instructors wishing to acquire CITL assistance can contact [email protected] .
V. REFERENCES FOR FURTHER READING
Ebel, R. L. (1965). Measuring educational achievement . Prentice-Hall. Ebel, R. L. (1972). Essentials of educational measurement . Prentice-Hall. Gronlund, N. E. (1976). Measurement and evaluation in teaching (3rd ed.). Macmillan. Mehrens W. A. & Lehmann I. J. (1973). Measurement and evaluation in education and psychology . Holt, Rinehart & Winston. Nelson, C. H. (1970). Measurement and evaluation in the classroom . Macmillan. Payne, D. A. (1974). The assessment of learning: Cognitive and affective . D.C. Heath & Co. Scannell, D. P., & Tracy D. B. (1975). Testing and measurement in the classroom . Houghton Mifflin. Thorndike, R. L. (1971). Educational measurement (2nd ed.). American Council on Education.
Center for Innovation in Teaching & Learning
249 Armory Building 505 East Armory Avenue Champaign, IL 61820
Brame, C. (2013) Writing good multiple choice test questions. Retrieved [todaysdate] from https://cft.vanderbilt.edu/guides-sub-pages/writing-good-multiple-choice-test-questions/.
Constructing an Effective Stem
Constructing effective alternatives.
Additional Guidelines for Multiple Choice Questions
Considerations for Writing Multiple Choice Items that Test Higher-order Thinking
Additional resources.
Multiple choice test questions, also known as items, can be an effective and efficient way to assess learning outcomes. Multiple choice test items have several potential advantages:
Reliability: Reliability is defined as the degree to which a test consistently measures a learning outcome. Multiple choice test items are less susceptible to guessing than true/false questions, making them a more reliable means of assessment. The reliability is enhanced when the number of MC items focused on a single learning objective is increased. In addition, the objective scoring associated with multiple choice test items frees them from problems with scorer inconsistency that can plague scoring of essay questions.
Validity: Validity is the degree to which a test measures the learning outcomes it purports to measure. Because students can typically answer a multiple choice item much more quickly than an essay question, tests based on multiple choice items can typically focus on a relatively broad representation of course material, thus increasing the validity of the assessment.
The key to taking advantage of these strengths, however, is construction of good multiple choice items.
A multiple choice item consists of a problem, known as the stem, and a list of suggested solutions, known as alternatives. The alternatives consist of one correct or best alternative, which is the answer, and incorrect or inferior alternatives, known as distractors.
1. The stem should be meaningful by itself and should present a definite problem. A stem that presents a definite problem allows a focus on the learning outcome. A stem that does not present a clear problem, however, may test students’ ability to draw inferences from vague descriptions rather serving as a more direct test of students’ achievement of the learning outcome.
2. The stem should not contain irrelevant material , which can decrease the reliability and the validity of the test scores (Haldyna and Downing 1989).
3. The stem should be negatively stated only when significant learning outcomes require it. Students often have difficulty understanding items with negative phrasing (Rodriguez 1997). If a significant learning outcome requires negative phrasing, such as identification of dangerous laboratory or clinical practices, the negative element should be emphasized with italics or capitalization.
4. The stem should be a question or a partial sentence. A question stem is preferable because it allows the student to focus on answering the question rather than holding the partial sentence in working memory and sequentially completing it with each alternative (Statman 1988). The cognitive load is increased when the stem is constructed with an initial or interior blank, so this construction should be avoided.
1. All alternatives should be plausible. The function of the incorrect alternatives is to serve as distractors,which should be selected by students who did not achieve the learning outcome but ignored by students who did achieve the learning outcome. Alternatives that are implausible don’t serve as functional distractors and thus should not be used. Common student errors provide the best source of distractors.
2. Alternatives should be stated clearly and concisely. Items that are excessively wordy assess students’ reading ability rather than their attainment of the learning objective
3. Alternatives should be mutually exclusive. Alternatives with overlapping content may be considered “trick” items by test-takers, excessive use of which can erode trust and respect for the testing process.
4. Alternatives should be homogenous in content. Alternatives that are heterogeneous in content can provide cues to student about the correct answer.
5. Alternatives should be free from clues about which response is correct. Sophisticated test-takers are alert to inadvertent clues to the correct answer, such differences in grammar, length, formatting, and language choice in the alternatives. It’s therefore important that alternatives
have grammar consistent with the stem.
are parallel in form.
are similar in length.
use similar language (e.g., all unlike textbook language or all like textbook language).
6. The alternatives “all of the above” and “none of the above” should not be used. When “all of the above” is used as an answer, test-takers who can identify more than one alternative as correct can select the correct answer even if unsure about other alternative(s). When “none of the above” is used as an alternative, test-takers who can eliminate a single option can thereby eliminate a second option. In either case, students can use partial knowledge to arrive at a correct answer.
7. The alternatives should be presented in a logical order (e.g., alphabetical or numerical) to avoid a bias toward certain positions.
8. The number of alternatives can vary among items as long as all alternatives are plausible. Plausible alternatives serve as functional distractors, which are those chosen by students that have not achieved the objective but ignored by students that have achieved the objective. There is little difference in difficulty, discrimination, and test score reliability among items containing two, three, and four distractors.
Additional Guidelines
1. Avoid complex multiple choice items , in which some or all of the alternatives consist of different combinations of options. As with “all of the above” answers, a sophisticated test-taker can use partial knowledge to achieve a correct answer.
2. Keep the specific content of items independent of one another. Savvy test-takers can use information in one question to answer another question, reducing the validity of the test.
When writing multiple choice items to test higher-order thinking, design questions that focus on higher levels of cognition as defined by Bloom’s taxonomy . A stem that presents a problem that requires application of course principles, analysis of a problem, or evaluation of alternatives is focused on higher-order thinking and thus tests students’ ability to do such thinking. In constructing multiple choice items to test higher order thinking, it can also be helpful to design problems that require multilogical thinking, where multilogical thinking is defined as “thinking that requires knowledge of more than one fact to logically and systematically apply concepts to a …problem” (Morrison and Free, 2001, page 20). Finally, designing alternatives that require a high level of discrimination can also contribute to multiple choice items that test higher-order thinking.
Burton, Steven J., Sudweeks, Richard R., Merrill, Paul F., and Wood, Bud. How to Prepare Better Multiple Choice Test Items: Guidelines for University Faculty, 1991.
Cheung, Derek and Bucat, Robert. How can we construct good multiple-choice items? Presented at the Science and Technology Education Conference, Hong Kong, June 20-21, 2002.
Haladyna, Thomas M. Developing and validating multiple-choice test items, 2 nd edition. Lawrence Erlbaum Associates, 1999.
Haladyna, Thomas M. and Downing, S. M.. Validity of a taxonomy of multiple-choice item-writing rules. Applied Measurement in Education , 2(1), 51-78, 1989.
Morrison, Susan and Free, Kathleen. Writing multiple-choice test items that promote and measure critical thinking. Journal of Nursing Education 40: 17-24, 2001.
Teaching Guides
Quick Links
Services for Departments and Schools
Examples of Online Instructional Modules
Page Content
Sidebar Content
Main Navigation
Quick links
All TIP Sheets
Taking Lecture Notes
Study Tips for Biology Classes
How to Study for Tests
Working Successfully with a Study Group
Avoiding Test Anxiety
Multiple Choice and Other Objective Tests
Essay Tests
Take-Home and Open-Book Tests
Plan Ahead: Studying for Finals
TIP Sheet MULTIPLE CHOICE AND OTHER OBJECTIVE TESTS
General Statements about Objective Tests
Objective tests require recognition and recall of subject matter.
The forms vary: questions of fact, sentence completion, true-false, analogy, multiple-choice, and matching.
They tend to cover more material than essay tests.
They have one, and only one, correct answer to each question.
They may require strict preparation like memorization.
Before Answering
Listen carefully to oral directions.
Notice if there is a penalty for guessing.
Glance quickly through the entire test.
Observe point values of different sections.
Budget your time.
Read the instructions and follow them.
Write your name on each page of the test.
While Answering
Read all directions carefully.
Read each question carefully.
If allowed to, underline key words.
Answer the easy questions first.
Skip questions that stump you. Mark them to come back later.
If you have time at the end, go back to the questions you marked.
Do not go back over every question. Reread only the ones that you were unsure of.
Do not second-guess yourself. Change an answer only if you are absolutely sure your first answer was wrong. The odds are in your favor that your first answer was right.
Make sure you have answered all the questions.
If you have no idea of the answer, guess!
STRATEGIES FOR TAKING OBJECTIVE EXAMS
Prepare thoroughly for all of your exams. There is no real substitute for studying. Start studying for your final exam the first day of class.
Use a variety of study strategies. Know your preferred learning style and take advantage of it!
Pay no attention to students who finish early. Do not automatically presume that students who finish early did well on the test (they often leave early because they didn't study enough!)
Plan on being the last one to leave. That way you can relax and make the most of your time.
Ignore what other students are saying before and after the exam.
Consider all alternatives in a multiple choice question before making your decision.
Always guess if there is no penalty for guessing.
Do not guess if there is a penalty for guessing and you have no basis on which to make a good choice.
Eliminate options which are known to be incorrect and choose from the remaining options.
Look for information in test items that will help you answer other questions.
Pay close attention to key words on True-False Tests.
a. Closed words (such as never , only , always , all , none , and most ) are often (but not always) indicators of a false statement because they restrict possibilities.
b. Open words (such as usually , frequently , mostly , may , and generally ) are often (but not always) found in true statements.
STEPS TO REMEMBER
To help you score as high as possible on all exams we have devised a plan of attack called SCORER. Each letter in the word stands for an important rule in test-taking. SCORER is based on the experience of many teachers and students and on research findings -- it might work for you!
S - Schedule your time.
C - Clue words help.
O - Omit the difficult questions.
R - Read carefully.
E - Estimate your answers.
R - Review your work.
S - The first letter in SCORER reminds you to SCHEDULE your time.
Consider the exam as a whole. How long is it? How many sections? How many questions? Are there especially easy or very difficult sections or questions? Estimate roughly the time needed for each section. Schedule your time.
For example, in a 50-minute test containing 20 questions you can spend about 50 divided by 20 or 21 minutes on each question. If you start at 9 AM you should be one-third finished by 9:17 halfway by 9:25 working on question 16 by 9:40. If you lag much behind these times you will run out of time before you finish the test.
C - The second letter in SCORER reminds you to watch for CLUE WORDS.
Almost every question has built-in clues to what is wanted. In a true-false test the Instructor must make up questions that are absolutely true or absolutely false. If he asks: "An unhappy childhood produces a neurotic adult. (True or False?)," he has a question he cannot grade. The more you know about psychology the more difficult this question is to answer. It is sometimes true, sometimes not: true for some people, false for others.
"An unhappy childhood always produces a neurotic adult." Vs. "An unhappy childhood never produces a neurotic adult." Vs. "An unhappy childhood sometimes produces a neurotic adult."
The first two are clearly false and the last is clearly true. The words always, never, and sometimes are called clue words.
"All men are taller than all women." "Some men are taller than women." "Men are never taller than women." "Men are usually taller than women." "Men are sometimes taller than women."
Answers: False, True, False, True, True
The clue words are all, some, never, usually, sometimes. These words are a key to answering objective test questions.
Some clue words such as all, every, none, exactly, always, and never indicate that the statement is absolutely true. Exceptions are not allowed. If they appear in a statement it must be true in every case to be true at all. For example:
"All squares have four equal sides." (That's a definition.)
"Every insect has six legs." (if it has more or less than six it is not an insect.)
"Politicians are invariably dishonest." (That means there has never been an honest politician. We're not certain, but we think this is false.)
Other clue words such as many, most, some, usually, few, or often are qualifiers. They indicate a limited range of truth.
"Some apples are green." (Sure, some apples are also yellow, pink, and even red.)
All clue words are red lights for test takers. When you see one, STOP and learn what it is telling you.
O - The third letter in SCORER reminds you to OMIT the DIFFICULT QUESTIONS.
A test is not the sort of semi-fatal illness you fall into; it is a battle to be planned, fought, and won. You size up the enemy, look at the terrain, check out his artillery, develop your strategy, and attack at the place you have the best chance of success. The 0 rule in SCORER says that to score high on tests you should find the easiest questions and answer them first. Omit or postpone the more difficult ones later.
The procedure for an objective exam is the following:
Move rapidly through the test.
When you find an easy question or one you are certain of, answer it.
Omit the difficult ones on this first pass.
When you skip a question, make a mark in the margin. (Do not use a red pencil or pen. Your marks could get confused with the grader's marks).
Keep moving. Never erase. Don't dawdle. Jot brief notes in the margin for later use if you need to.
When you have finished the easy ones return to those with marks, and try again.
Mark again those answers you are still not sure of.
In your review (that's the last R on SCORER) you will go over all the questions if time permits.
R - The fourth letter of SCORER reminds you to READ CAREFULLY.
As we have already explained, it is very important that you read the directions carefully before you begin. It is also very important that you read each question completely and with care.
Read all of the questions. Many students, because they are careless or rushed for time, read only part of the question and answer it on the basis of that part. For example, consider the statement "Supreme Court decisions are very effective in influencing attitudes." If you disagree with some Supreme Court decisions you may mark it false after reading the first six words. The political scientist knows it is true. He is not asking you whether the Court is doing a good job, only what the effects of its decisions are.
Read the question as it is. Be careful to interpret the question as the instructor intended. Don't let your bias or expectation lure you into a false reading. For example, the statement "Once an American, always an American." may be marked true by a super-patriot who believes it should be true. Legally, it is not true.
Read it logically. If the statement has several parts, all parts must be true if the statement is to be true. The statement, "George Washington was elected president because he was a famous film star." is false. (Not in 1776. Today it might be possible.) The statement, "Chlorine gas is a greenish, poisonous, foul-smelling, very rare gas used in water purification," is false. (It is not rare.)
E - The E in SCORER reminds you to ESTIMATE.
Your instructor may never admit it, but you can go a long way on an objective exam by guessing.
On most true-false or multiple-choice tests, your final score is simply the number you answer correctly. Wrong answers are ignored. There is not a penalty for guessing. On some tests you may have points subtracted from your score for wrong answers. Be certain you know how the test will be scored. If the test directions do not make it perfectly clear, ask your instructor.
If there is no penalty for guessing, be certain you answer every question even if you must guess.
If you have plenty of time, proceed as we have already outlined: omit or postpone the difficult questions, answer the easy ones first, return to the difficult ones later. Guess on any you do not know. (But be careful. Your instructor may be upset if you start flipping a dime and shouting "Heads" and "Tails" during the exam.)
If the test is a long one and you are pressed for time, answer the easy ones, guess at the difficult ones.
If guessing is penalized, then do not guess on true-false questions and make an educated guess on multiple-choice questions only if you can narrow the possibilities down to two. Guess at completion or fill-in questions if you have any idea of what the answer is. Part of a correct answer may earn some credit.
"Guesstimating" is an important part of test-taking.
R - The last letter in SCORER is a reminder to REVIEW your work.
Use every minute that is available to you. Anyone who leaves the exam room early is either very foolish or super-confident. Review everything you have done.
Return to the double-checked, difficult questions. Reread them. Look for clue words. Look for new hints. Then go to the checked questions and finally to the unmarked ones if there is still time.
Don't be too eager to change answers. Change only if you have a good reason for changing.
Be certain you have considered all questions.
It is most important to build up your knowledge and understanding of the subject through systematic study, reading, and class work. SCORER is designed to help you do you best with what you know.
__________________________________________
More on Multiple Choice Tests
Following are additional specific strategies that can be used when taking multiple choice tests:
There are three major reasons that multiple-choice questions appear on many college tests.
They can be used to test all aspects of students, knowledge and their ability to reason with information that they have learned.
If students have difficulty expressing their thoughts in writing, poor writing ability will not lower their grades on multiple-choice tests.
When answers are recorded on answer sheet, multiple choice tests are easy to grade.
Because of these advantages, you will answer many multiple choice questions on the tests you take during your college career.
Stems, Options, and Distractors
Multiple-choice questions are usually either incomplete statements followed by possible ways the statements may be completed or they are questions followed by possible answers. The following question is an incomplete statement followed by possible ways the statement may be completed.
In this country, the ultimate legal responsibility for the education of children belongs to:
a. parents. b. states. c. the federal government. d. local school boards.
The first part of a multiple-choice question is called the stem. The stem of the above example is:
" In this country, the ultimate legal responsibility for the education of children belongs to "
The choices that are given for answers are called options. These are the options in the example:
parents; states; the federal government; local school boards
Options are written so that one is the correct answer and the others are distractors. The correct answer to this question is option b; options a, c, and d are distractors. Correct answers are supposed to be selected by students who know correct answers. Other students are supposed to be distracted and select one of the other options -- one of the distractors.
Eliminate the distractors
The basic strategy for answering a multiple choice question is to eliminate the distractors and to select as the correct answer the option that is not a distractor. One way to locate distractors is to analyze a multiple choice question as though it is a series of true-false questions. The following questions about American history may be analyzed in this way.
Centers for early gold rushes were in the present-day states of:
a. Oklahoma and Texas. b. California and New Mexico. c. Kansas and Nebraska. d. Nevada and Colorado.
This question, like most multiple-choice questions, is actually a series of true-false questions, only one of which is true. All the options are false except d.
When you answer a multiple-choice question, indicate with an X or a check mark the options that you decide are distractors. For example:
Oklahoma and Texas. X
California and New Mexico .
Kansas and Nebraska. X
Nevada and California .
In this example, a student has decided that option a and option c are distractors. She/He will eventually cross out option b and decide that option d is correct, or she will cross out option d and decide that option b is correct. The correct answer is option d.
Use common sense and sound reasoning
You may sometimes be able to select the correct answer to a multiple-choice question by using common sense, sound reasoning, experience you have had, and information you know. For instance, since you have been or have known many male adolescents, you can probably use your experience to answer the following question correctly.
Which of the following is not a secondary sex characteristic of normal male adolescents?
a. Their voices deepen. b. They grow facial hair. c. Their subcutaneous fat increases. d. Their muscles develop noticeably.
Even if you do not know what a secondary sex characteristic is, you do know that options a, b, and d state facts about male adolescents. You might, therefore, conclude that option c does not state a fact about young men. Option c is the correct answer; it describes female adolescents.
Sometimes you may know information that will help you to select a correct answer. For instance, you may know that the word intrinsic refers to "that which is within." If you know the meaning of intrinsic, you should be able to answer the following question correctly.
Which of the following is an example of an intrinsic reward?
a. food b. money c. praise d. self-approval
If you know the meaning of intrinsic , you should select option d as the correct answer. Self-approval is an intrinsic reward – it comes from within a person. Food, money, and praise, on the other hand, are extrinsic rewards – they come from outside a person.
Summary for Multiple Choice Questions
When you answer a multiple-choice question:
1. Cross out the distractors and select as the correct answer the option that is not a distractor.
2. Use common sense, sound reasoning, experiences you have had, and information you know to select correct answers.
When necessary, make your best guess:
Although no specific techniques can be applied to all multiple choice tests, the following are frequently means of getting points out of questions for which you don't really know the answers.
Occasionally, testers overlook some of the faults described below. It is important to use the following techniques with care to determine if they are applicable.
I. AT TIMES THE LONGEST ANSWER IS THE CORRECT ONE. Example:
The results of research on a sample drawn form the 9th grade students who have failed Algebra will:
a. have no specific significance. b. yield important data for all high schools. c. generalize for the narrow population, but may carry implications for similar populations.
The answer is c, mainly because it is the longest and most complete. Usually a test writer makes up a multiple choice test by leafing through the material to be tested. He may come upon a statement that seems to provide a question and answer, and he bases the multiple choice item on this. Test writers in a hurry write as few words as they can get away with. Therefore, they skimp when they are writing incorrect choices on a multiple choice test. The best way to determine length is to compare the number of words used in the answer. The physical length is less important. Usually the choice containing the most words is the right answer.
II. IN A CARELESSLY WRITTEN TEST, ONE OR MORE OF THE POSSIBLE ANSWERS MAY BE ELIMINATED ON GRAMMATICAL GROUNDS. Examples:
Which of the following are the best source of information concerning the interior structure of the earth?
a. barogram b. seismograms c. thermogram d. hygrogram
The question asks for a plural answer. ("Which of the following are....") Only b is a plural answer, so that is the correct one.
Shakespeare's reference to clocks in "Julius Caesar" is an example of an:
a. anachronism b. antiquareanisms c. poetic licence d. ignorance
Grammatical grounds eliminate option c since the question calls for an answer beginning with a vowel " example of an ...." Answer a and b begin with the same syllable, so it is probably one of these two: b is plural, and the question asks for singular answer. The best choice is a.
III. IF TWO CHOICES BEGIN WITH THE SAME SOUND OR CONTAIN DISTINCTIVE SOUNDS OR SPELLING, THE CORRECT ANSWER TENDS TO BE ONE OF THESE TWO CHOICES.
Often a test writer will think it smart to include among the wrong answers a distractor similar to the right answer. This is done to ensure that the student is more than just vaguely familiar with what might be the correct answer.
The functional unit of the kidney is:
a. the pelvis b. the nephron c. the neuron d. the medulla
Options b and c are very similar in spelling, so one of those is probably the answer. After this there are no clues, so that a student must use knowledge or guess. Option b is the correct answer.
The water bearing layer of an artesian formation is most likely composed of:
a. limestone b. sand c. granite d. sandstone
The work "sand" is repeated in b and d, and "stone" occurs in a and d. Answer d has both repeated elements. The best guess could be d.
IV. AVOID ANSWERS THAT REPEAT IMPORTANT WORDS GIVEN IN THE QUESTION.
Many test writers routinely include wrong answers that repeat terms of the question just to distract wild guessers.
An important commercial source of ammonia is:
a. ammonia water b. coal tar c. soft coal d. petroleum
The repetition of " ammonia " in answer a potentially eliminates that as the correct choice.
"Coal" in both b and b suggests one of these answers, and c is the correct one.
Test questions are often taken directly from the textbook. Watch for "unusual" or "catchy" statements. Watch for dates, definitions, or statements of facts.
V. ASK, before you take the test, if you are penalized for guessing. If so, don't guess. The instructor may subtract the number wrong for the number right. Then you may pay twice for every wrong answer.
VI. UNDERSTAND precisely how to indicate the answers. (Do you put your "x" by the right one or the wrong one?)
VII. WATCH your numbers. It's easy to get mixed up.
VIII. WATCH for special words.
Statements with never or always are likely to be false.
Moderate statements are often true.
An answer that is "almost, but not quite true" is still false.
Extreme statements are almost always false.
Read through each question quickly and answer the ones you are fairly sure of first. Spend little time on the questions, and skip the ones you don't know. These can be analyzed when you can come back to them. Remember that these test techniques alone will not help you do well on a test. Your knowledge of the subject matter is the main determinant of how well you will do!
Home | Calendars | Library | Bookstore | Directory | Apply Now | Search for Classes | Register | Online Classes | MyBC Portal MyBC -->
Butte College | 3536 Butte Campus Drive, Oroville CA 95965 | General Information (530) 895-2511
Qorrect Blog
7 Key Characteristics of a Good Test in Education in 10 Minutes
Yasmine Nasr December 15, 2021 136,215 Views
Table of Contents
Sticking to as many characteristics of a good test in education as possible is a challenging process for teachers.
In 10 min or less, you’ll get a brief on all commonly agreed upon characteristics, practical ways to employ them in order to make your test reliable, and top of the world universities that use them!
One of the major goals of education is to prepare students for the next step in their future. They have to make sure that their learners have acquired enough knowledge about the field of study. Only good tests ensure this. A good test is not only a score that learners struggle to ace.
It’s feedback a student receives to improve his skills and knowledge and a good teacher loves to get back to, always, to make sure their teaching strategies are on point and whether they need development or not.
It’s also a feedback for decision-makers in all educational institutions and governmental positions who need good data to get to the next step of the institution or the State’s education plan.
It’s not something centric that students spend days of anxiety on, wondering how well they will do in a given test and how well the test questions are actually written and whether they are questions they do know the answer to or not.
What Are the Characteristics of a Good Test in Education?
What is a good test in education? It is an evaluation through which teachers measure learners’ abilities and points of weaknesses and strengths. It gauges their knowledge in the field of study and provides both sides with real feedback.
A good test should ensure that learners are ready to move to the next step whether this step is a high school, college, or even the military.
In our previous event, the first free online webinar, “ Ensuring Effective E-Assessment for Higher Education ,” Qorrect e-assessment team, discussed the complete cycle of a good test in detail focusing on higher education examination.
The team discussed how to analyze, design, develop, implement, and evaluate the phases that together comprise the e-assessment life cycle , going through the e -assessment life cycle and its importance to higher education, edtech role in the evolution of the digital assessment process .
That’s plus considering the contribution of edtech in improving assessment quality , analysing the examinees’ responses, assessing the exam’s quality and the effectiveness of the involved questions in measuring what they are designed to measure.
Read our article “ Why You Shouldn’t Overlook the Importance of Probing Questions in Teaching ”
What Are the Qualities of Good Assessment?
An assessment is a process through which students can share their educational experiences. In order for a test to be a good tool for measuring students’ knowledge and skills, it should have the following characteristics of examination that are essential for the success of any test.
Reliability or Consistency
Reliability or consistency of a test means that learners should perform the same or get the same score if they are exposed to different questions in different times and places. A test is considered reliable when the same result is achieved over different tests.
As James Carlson mentions in his research memorandum, “The reliability of test scores is the extent to which they are consistent across different occasions of testing, different editions of the test, or different raters scoring the test taker’s responses.” He also mentions some statistics to describe how a test can be reliable.
How to Make Sure Your Test Is Reliable?
Score Distribution: The percentage of test takers at each score level.
Mean Score : The average score, computed by summing the scores of all test takers and dividing by the number of test takers.
Standard Deviation : A measure of the amount of variation in a set of scores. It can be interpreted as the average distance of scores from the mean. (Actually, it is a special kind of average called a “root mean square,” computed by squaring the distance of each score from the mean score, averaging the squared distances, and then taking the square root.)
Correlation : A measure of the strength and direction of the relationship between the scores of the same people on two tests.
Reliability is the ratio of the true score and the observed score variance. To measure a test’s reliability, we may administer a test to the same group more than once.
However, errors may occur as students may forget or have some physical problems. Thus, it is crucial to administer the same test in identical conditions to ensure that we will get the same results.
Read “ How to Make an Online Exam for Students in Only Minutes ”
A validity of a test can be achieved when the test measures what it is really intended to measure. Therefore, a certain criteria must be selected.
Validity is very important to gauge the quality of a given test as questions must be in line with the selected criteria and measures.
Here are some of the top different types of validity:
Content Validity: A test should fairly represent the content of the course or the field of study.
Criterion Validity: It is used to predict the performance of a job applicant or a student.
Convergent validity : This is mostly used in the field of sociology or psychology.
Discriminant Validity: Discriminant validity means that a test of a concept is not highly correlated with other tests that are set to measure theoretically different concepts.
See how you can make objective, valid tests in education using Qorrect e-assessment system. Reserve a free Demo now .
Objectivity
According to Gronlund and Linn “Objectivity of a test refers to the degree to which equally competent scores obtain the same results,” the test should be away from any personal or subjective judgment. It should be based only on the evaluation of human development.
For example, in an essay-type test, students answer differently as each one has his/her own style of writing.
Hence, when more than one instructor check the test, they may give different scores according to whether they like the style or not. So, here, the test is less objective.
To avoid such bias, sharp rules should be set in evaluating such types of tests. There should be a unified guide for teachers to use while correcting such tests.
Personal judgment does not occur in true or false or multiple choice tests. Besides, teachers should receive training on how to score a test as untrained teachers may give wrong scores and not be able to maintain the required fairness and accuracy.
Read “ Standardized Tests: History and Implementation in Universities ”
Comprehensiveness
A test should fully cover the entire field of study that students are exposed to during the course. Vague questions should not be included especially during online tests when students are confused and short in time.
Absence of Ambiguity
There has to be no place for ambiguity especially in online tests where examiners are absent. Students should not be left in confusion and all questions have to be crystal clear.
According to Jacobs, Lucy C., from Indiana University, “ambiguous questions constitute the major weakness in college tests.
Ambiguous questions often result when instructors put off writing test questions until the last minute. Careful editing and an independent review of the test items can help to minimize this problem.”
Read “ What’s the Difference between Online Exam and Offline Exam ?”
Preparation
To ensure the success of any test, instructors should take into consideration the following factors:
Students have to be well-prepared for the test through extensive revisions and discussions.
There should not be any gaps between the revision period and the exam.
Examiners should make it clear to students which topics are expected to be tackled in the exam.
Students should be well-trained for the test type.
Appropriateness of Time
One of the top characteristics of a good test is when students have appropriate time to answer all questions. For example, essay questions require more time than multiple choice or true/false questions.
Some teachers take the test themselves first and then double or triple the time for students. A good test is supposed to be practical and comprehensive.
Our Conclusion of Characteristics of Good Test
There is a strong sense, however, that the use of the word ‘characteristics’ or ‘criteria’ is not optimal. It implies the development of standards against which assessments could be judged.
Instead, we believe there should be a general agreement that the word ‘framework’ captures our desire to create a structure that might be useful in the development of a good test in education more precisely.
7 Outstanding Characteristics of a Good Online Test
No logistic setback
Easy access from anywhere
High speed
Support essay questions, multiple-answer questions, short answers, & equation & scientific questions
Built-in questions bank in quality online test systems
Immediate students results reports are generated
Highly detailed, error-free analytics reports on students’ performance as well as test and questions quality
Characteristics of a Good Test with Examples
What is the purpose of a test.
It is an evaluation process through which examiners know who you are and what you know and think. They identify how you are different from others.
Types of Tests
Tests can be categorized into two types according to the questions they tackle:
Essay Questions Tests
This type is intended to gauge students’ information and knowledge of the field of study. It measures their writing skills and how well they are able to show their personality in writing.
There should not be anything to be memorized as students answer according to their understanding of the course materials.
Through this type of tests, instructors are able to measure students’ logical thinking and problem solving skills.
Read “ Proctored Exams: Here’s How to Successfully Do Remote Proctoring ”
Objective Questions Tests
Such tests are easy to be marked as they have true and false answers and hence they are away from any personal opinion or subjectivity. For example, true or false and multiple choice questions are objective tests.
In an article titled “ Harvard Courses Turn to Monitored Exams, Open-Book Assessments, and Faith in Students As Classes Move Online ” Juliet E. Isselbacher and Amanda Y. Su, The Crimson writers, showed experiences of different professors through the COVID-19 pandemic and how they were forced to switch to online learning.
Professor Robert N. Stavins decided to change the exam to be open-book so as to guarantee equality among students especially during the absence of any monitoring during the online tests.
Other professors preferred to keep the same old style of the closed-book exam ensuring that it is verified and monitored as professor Chaudoin said “We have to trust the students, and the online exam tools give us a partial way to monitor things.”
Iaura Rose Smith, from the University of Manchester, shared her experience in her article titled “ My Online Exam Experience and Top Tips for Students .” She made it clear that online examinations have changed her way of studying.
Instead of just thinking of passing a test, she focused on revision, knowledge, and real understanding of the course material. She said, “I would recommend using this opportunity to gain a deeper understanding of your subject area and expand your knowledge further than the curriculum.”
Verbs Best Used in Good Tests
Educators recommend using a set of key verbs related to Bloom’s theory, classification of educational objectives, when writing any type of test questions/test items.
This list of verbs guarantees that the teacher or test creator is indeed asking the right questions, equivalent to that of the students’ level of knowledge and understanding. Here are some of these verbs, according to California State University website.
Knowledge
Arrange – define describe – duplicate identify – label – list match – memorize name – order – outline recognize – relate – recall – repeat reproduce – select – state
With the exposure of online learning and the use of advanced software systems in education, most instructors had to change the traditional way of testing.
Traditional Assessment
Teachers used to measure students’ knowledge only by how they score in a given exam. They give students only one chance to show their competencies without discussions or classroom projects.
Online Assessment
Online assessment is a way through which teachers can improve students’ learning, knowledge, beliefs, and skills. Online assessments can be behavioral, cognitive, or communicative assessments.
Students may take the online assessment in the classroom or at home and this reduces their stress. New tools are now introduced for instructors to set different types of assessments.
They can use game-based assessments through many tools such as Kahoot, as mentioned in our previous article “ 11 Best Exam and Assessment Platforms of 2021 .”
Teachers can also create polls and activities. Moreover, Google Forms enable teachers to create and grade quizzes. They can choose multiple-choice quizzes or short-answer quizzes.
Some tools also provide teachers with excel reports of students’ grades and feedback can be sent easily to students directly after the exam. Many advanced software systems allow teachers to deliver reliable and cheat-free exams to students and grade them instantly. This saves a lot of time for teachers.
Qorrect (e-exams system) generates automated reports of the test results. To analyze the quality of the test, it provides feedback that no cheating happened during the online exam, and analyzes the performance of the students during the examination.
Read “ Summative Assessment Overview, Purposes, and Best Online Software ”
How Do I Write a Good Test?
Be specific
Do not use ambiguous questions.
Choose a suitable format for your test.
Avoid the open-ended questions unless you are willing to accept any answer.
Choose your words carefully and avoid any ambiguous language.
Students should know how much each of the questions are worth.
To conclude, teachers should create their exams away from any subjectivity, ambiguity, or lack of comprehensiveness. The appropriate format should be selected to match the course materials and to measure students’ knowledge and skills.
15 Things You Need to Know about the Characteristics of a Good Test in Education
Here are 15 tips the American Board concluded its workshop “ Modes of Classroom Assessment ” with:
Bloom’ theory of educational objectives classification, in which cognitive skills exist in a hierarchical order, is important in any assessment.
Assessment works better when they are ongoing and integrated into instruction as opposed to episodic and marginally referenced to classroom instruction.
Lots of faculty members use MCQs for their classes because they are able to cover much greater content and in a very short period of time; plus they’re known to be very easy, compared to other questions, and are quick to score.
Many professors and test managers prefer to use other types of tests to assess their students/examinees: essay, papers or electronic portfolios, projects, presentations tests.
Both views are not incorrect! However, a great teacher would use all of the previously mentioned assessment forms throughout the academic year.
The value of tests is much greater and more pronounced when they are performed as a part of a completely comprehensive program that’s designed to enhance learning, progress, performance, and the educational institution’s success
“A comprehensive assessment-instruction system should contain a variety of assessment techniques.”
A test can only test what it was formed to assess. So it’s up to the decision-makers to process the data generated from it.
Summative assessments are called “assessments of learning,” and formative assessments are called “assessment for learning.”
There should always be a balance between the intellectual skills being assessed.
Specific and descriptive instructional feedback that will help students to improve their learning and prepare for mastery of the curricular topics are central to effective formative assessments.
Frequent short tests are more instructionally helpful and provide better assessment data than infrequent extended exams.
Diagnostic assessments measure a student’s current knowledge and skills for the purpose of identifying a personalized program of learning for that student.
Quality assessments are valid, reliable and unbiased.
A test is no better than the quality of items it contains.
Are You Testing Students or Customers?
In their book “ The Trouble with Higher Education ,” Patrick Smith and Trevor Hussey had a unique outlook on the system of education in general.
The book linked the rise of consumerism with education, addressing the effects this now has on everything learning-related.
That includes how the system of universities works today and the high prices we face today in a lot of top of the world higher ed institutions and universities… too high some students stay in debt for years (although the book addresses education in the UK, a lot of other countries may relate to the issues raised).
It is a significant consequence of these changes that students have come to see themselves as customers . Increasingly their perception is that they are buying a product. This encourages an instrumental view of education: its value lies not in itself but in what it can be used to gain. An education that has to be purchased at great expense is purchased for a purpose, and that purpose is what it will earn. At the very least it must pay for itself.
Because of this we must start to raise the bar for quality education and testing. Because students or customers are now less willing to tolerate less quality education, teaching, facilities, testing, grading, & reporting.
Do you want to see in action how you can employ the characteristics of a good test in education for FREE? Reserve a free demo now .
Claim Your Free Trials
About Yasmine Nasr
Streamlining Assessment for Learning: How Qorrect Integrates with Online Learning Platforms
Incorporating technology in education is no longer just an option; it’s essential. Institutions are constantly …
Eberly Center
Teaching excellence & educational innovation, what is the difference between formative and summative assessment, formative assessment.
The goal of formative assessment is to monitor student learning to provide ongoing feedback that can be used by instructors to improve their teaching and by students to improve their learning. More specifically, formative assessments:
help students identify their strengths and weaknesses and target areas that need work
help faculty recognize where students are struggling and address problems immediately
Formative assessments are generally low stakes , which means that they have low or no point value. Examples of formative assessments include asking students to:
draw a concept map in class to represent their understanding of a topic
submit one or two sentences identifying the main point of a lecture
turn in a research proposal for early feedback
Summative assessment
The goal of summative assessment is to evaluate student learning at the end of an instructional unit by comparing it against some standard or benchmark.
Summative assessments are often high stakes , which means that they have a high point value. Examples of summative assessments include:
a midterm exam
a final project
a senior recital
Information from summative assessments can be used formatively when students or faculty use it to guide their efforts and activities in subsequent courses.
CONTACT US to talk with an Eberly colleague in person!
What is Essay? || Characteristics of A Good Essay || CSS || PMS
"English Essay Topic COMPREHENSION By Ms Saima Zaidi"
Types of Evaluation: Subjective and Objective, Formative and Summative
Essay type tests/ long answer type tests/ items/ questions @sparshclasses
COMMENTS
Essay Test vs Objective Test
Difference between Essay tests and Objective Tests. 1 - In essay items the examinee writes the answer in her/his own words whereas the in objective type of tests the examinee selects the correct answer from the among several given alternatives. 2 - Thinking and writing are important in essay tests whereas reading and thinking are important ...
Essay Test Preparation Tips and Strategies
Being able to identify and becoming familiar with the most common types of essay test questions is key to improving performance on essay exams. The following are 5 of the most common question types you'll find on essay exams. 1. Identify. Identify essay questions ask for short, concise answers and typically do not require a fully developed essay.
The Four Main Types of Essay
An essay is a focused piece of writing designed to inform or persuade. There are many different types of essay, but they are often defined in four categories: argumentative, expository, narrative, and descriptive essays. Argumentative and expository essays are focused on conveying information and making clear points, while narrative and ...
Exam Questions: Types, Characteristics, and Suggestions
Essay questions differ from short answer questions in that the essay questions are less structured. This openness allows students to demonstrate that they can integrate the course material in creative ways. As a result, essays are a favoured approach to test higher levels of cognition including analysis, synthesis and evaluation.
Essay Exams
You must be realistic about the time constraints of an essay exam. If you write one dazzling answer on an exam with three equally-weighted required questions, you earn only 33 points—not enough to pass at most colleges. This may seem unfair, but keep in mind that instructors plan exams to be reasonably comprehensive.
PDF PREPARING EFFECTIVE ESSAY QUESTIONS
workbook was developed to provide training and practice in discerning the often difficult. to see characteristics of effective essay questions and to support educators in the. development and use of essay questions. This workbook supports educators from all schools and disciplines. In addition, it.
Essay Tests
Essay Tests. There are basically two types of exams: Objective - requires answers of a word or short phrase, or the selection of an answer from several available choices that are provided on the test. Essay - requires answers to be written out at some length. The student functions as the source of information.
PDF Essay Exams: Common Question Types
Identify Essay Type This type of essay asks for short, direct answers and usually does ... "What are the main characteristics?" Key Words List Summarize Enumerate Describe Define State Explain Essay Type This type of essay should be fully thought out and developed in as
Essay Test: The Ultimate Guide with The Best Strategies
An essay test is a type of assessment in which a student is prompted to respond to a question or a series of questions by writing an essay. ... Identify the main topic or theme and any specific requirements or instructions. Then, create an outline to organize your thoughts and structure your response. Begin your essay with a clear and concise ...
PDF AN APPROACH TO ESSAY TESTS
the person who reads them. When you write answers to an essay test, one instructor might think your answers are worth an A, while another might think they are worth a C. STEPS TO TAKE DURING THE TEST 1. Read through each question once and carefully. Answers for some questions will come to mind immediately.
Objective tests
Introduction. Objective tests are questions whose answers are either correct or incorrect. They tend to be better at testing 'low order' thinking skills, such as memory, basic comprehension and perhaps application (of numerical procedures for example) and are often (though not necessarily always) best used for diagnostic assessment.
PDF Is This a Trick Question?
Essay and objective exams can be used to measure the same content or ability. TRUE Both item types can measure similar content or learning objectives. Re-search has shown that students respond almost identically to essay and objective test items covering the same content. 9. Essay and objective exams are both good ways to evaluate a student's ...
Objective or Subjective? Those are the Questions
1. Objective, which require students to select the correct response from several alternatives or to supply a word or short phrase to answer a question or complete a statement. Examples: multiple choice, true-false, matching, completion. 2. Subjective or essay, which permit the student to organize and present an original answer.
Advantages, Disadvantages of Different Types of Test Questions
Advantages. Save instructors the time and energy involved in writing test questions. Use the terms and methods that are used in the book. Disadvantages. Rarely involve analysis, synthesis, application, or evaluation (cross-discipline research documents that approximately 85 percent of the questions in test banks test recall) Limit the scope of ...
Essay Exams
Essay Exams. Essay exams provide opportunities to evaluate students' reasoning skills such as the ability to compare and contrast concepts, justify a position on a topic, interpret cases from the perspective of different theories or models, evaluate a claim or assertion with evidence, design an experiment, and other higher level cognitive skills.
Tips for Creating and Scoring Essay Tests
Restricted Response - These essay questions limit what the student will discuss in the essay based on the wording of the question. For example, "State the main differences between John Adams' and Thomas Jefferson's beliefs about federalism," is a restricted response. What the student is to write about has been expressed to them within the question.
Improving Your Test Questions
I. Choosing Between Objective and Subjective Test Items. There are two general categories of test items: (1) objective items which require students to select the correct response from several alternatives or to supply a word or short phrase to answer a question or complete a statement; and (2) subjective or essay items which permit the student to organize and present an original answer.
Writing Good Multiple Choice Test Questions
In addition, the objective scoring associated with multiple choice test items frees them from problems with scorer inconsistency that can plague scoring of essay questions. Validity: Validity is the degree to which a test measures the learning outcomes it purports to measure. Because students can typically answer a multiple choice item much ...
Multiple Choice and Other Objective Tests
Multiple Choice and Other Objective Tests. General Statements about Objective Tests. Objective tests require recognition and recall of subject matter. The forms vary: questions of fact, sentence completion, true-false, analogy, multiple-choice, and matching. They tend to cover more material than essay tests. They have one, and only one, correct ...
PDF Objective Tests
Compare and contrast. Sometimes objective questions can be used to test your ability to distinguish concepts, ideas, theories, events, facts from each other. Construct diagrams, charts tables, or lists to summarize relationships. 5. Recite for precision. Review your retention of the information by recalling it often.
Key Characteristics of a Good Test in Education in 10 Min
Appropriateness of Time. One of the top characteristics of a good test is when students have appropriate time to answer all questions. For example, essay questions require more time than multiple choice or true/false questions. Some teachers take the test themselves first and then double or triple the time for students.
Formative vs Summative Assessment
The goal of summative assessment is to evaluate student learning at the end of an instructional unit by comparing it against some standard or benchmark. Summative assessments are often high stakes, which means that they have a high point value. Examples of summative assessments include: a midterm exam. a final project. a paper. a senior recital.
IMAGES
VIDEO
COMMENTS
Difference between Essay tests and Objective Tests. 1 - In essay items the examinee writes the answer in her/his own words whereas the in objective type of tests the examinee selects the correct answer from the among several given alternatives. 2 - Thinking and writing are important in essay tests whereas reading and thinking are important ...
Being able to identify and becoming familiar with the most common types of essay test questions is key to improving performance on essay exams. The following are 5 of the most common question types you'll find on essay exams. 1. Identify. Identify essay questions ask for short, concise answers and typically do not require a fully developed essay.
An essay is a focused piece of writing designed to inform or persuade. There are many different types of essay, but they are often defined in four categories: argumentative, expository, narrative, and descriptive essays. Argumentative and expository essays are focused on conveying information and making clear points, while narrative and ...
Essay questions differ from short answer questions in that the essay questions are less structured. This openness allows students to demonstrate that they can integrate the course material in creative ways. As a result, essays are a favoured approach to test higher levels of cognition including analysis, synthesis and evaluation.
You must be realistic about the time constraints of an essay exam. If you write one dazzling answer on an exam with three equally-weighted required questions, you earn only 33 points—not enough to pass at most colleges. This may seem unfair, but keep in mind that instructors plan exams to be reasonably comprehensive.
workbook was developed to provide training and practice in discerning the often difficult. to see characteristics of effective essay questions and to support educators in the. development and use of essay questions. This workbook supports educators from all schools and disciplines. In addition, it.
Essay Tests. There are basically two types of exams: Objective - requires answers of a word or short phrase, or the selection of an answer from several available choices that are provided on the test. Essay - requires answers to be written out at some length. The student functions as the source of information.
Identify Essay Type This type of essay asks for short, direct answers and usually does ... "What are the main characteristics?" Key Words List Summarize Enumerate Describe Define State Explain Essay Type This type of essay should be fully thought out and developed in as
An essay test is a type of assessment in which a student is prompted to respond to a question or a series of questions by writing an essay. ... Identify the main topic or theme and any specific requirements or instructions. Then, create an outline to organize your thoughts and structure your response. Begin your essay with a clear and concise ...
the person who reads them. When you write answers to an essay test, one instructor might think your answers are worth an A, while another might think they are worth a C. STEPS TO TAKE DURING THE TEST 1. Read through each question once and carefully. Answers for some questions will come to mind immediately.
Introduction. Objective tests are questions whose answers are either correct or incorrect. They tend to be better at testing 'low order' thinking skills, such as memory, basic comprehension and perhaps application (of numerical procedures for example) and are often (though not necessarily always) best used for diagnostic assessment.
Essay and objective exams can be used to measure the same content or ability. TRUE Both item types can measure similar content or learning objectives. Re-search has shown that students respond almost identically to essay and objective test items covering the same content. 9. Essay and objective exams are both good ways to evaluate a student's ...
1. Objective, which require students to select the correct response from several alternatives or to supply a word or short phrase to answer a question or complete a statement. Examples: multiple choice, true-false, matching, completion. 2. Subjective or essay, which permit the student to organize and present an original answer.
Advantages. Save instructors the time and energy involved in writing test questions. Use the terms and methods that are used in the book. Disadvantages. Rarely involve analysis, synthesis, application, or evaluation (cross-discipline research documents that approximately 85 percent of the questions in test banks test recall) Limit the scope of ...
Essay Exams. Essay exams provide opportunities to evaluate students' reasoning skills such as the ability to compare and contrast concepts, justify a position on a topic, interpret cases from the perspective of different theories or models, evaluate a claim or assertion with evidence, design an experiment, and other higher level cognitive skills.
Restricted Response - These essay questions limit what the student will discuss in the essay based on the wording of the question. For example, "State the main differences between John Adams' and Thomas Jefferson's beliefs about federalism," is a restricted response. What the student is to write about has been expressed to them within the question.
I. Choosing Between Objective and Subjective Test Items. There are two general categories of test items: (1) objective items which require students to select the correct response from several alternatives or to supply a word or short phrase to answer a question or complete a statement; and (2) subjective or essay items which permit the student to organize and present an original answer.
In addition, the objective scoring associated with multiple choice test items frees them from problems with scorer inconsistency that can plague scoring of essay questions. Validity: Validity is the degree to which a test measures the learning outcomes it purports to measure. Because students can typically answer a multiple choice item much ...
Multiple Choice and Other Objective Tests. General Statements about Objective Tests. Objective tests require recognition and recall of subject matter. The forms vary: questions of fact, sentence completion, true-false, analogy, multiple-choice, and matching. They tend to cover more material than essay tests. They have one, and only one, correct ...
Compare and contrast. Sometimes objective questions can be used to test your ability to distinguish concepts, ideas, theories, events, facts from each other. Construct diagrams, charts tables, or lists to summarize relationships. 5. Recite for precision. Review your retention of the information by recalling it often.
Appropriateness of Time. One of the top characteristics of a good test is when students have appropriate time to answer all questions. For example, essay questions require more time than multiple choice or true/false questions. Some teachers take the test themselves first and then double or triple the time for students.
The goal of summative assessment is to evaluate student learning at the end of an instructional unit by comparing it against some standard or benchmark. Summative assessments are often high stakes, which means that they have a high point value. Examples of summative assessments include: a midterm exam. a final project. a paper. a senior recital.