ebook img

ERIC ED384672: Figural-Response Assessment: System Development and Pilot Research in Cell and Molecular Biology. GRE Board Professional Report No. 89-02P. PDF

27 Pages·0.56 MB·English
by  ERIC
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview ERIC ED384672: Figural-Response Assessment: System Development and Pilot Research in Cell and Molecular Biology. GRE Board Professional Report No. 89-02P.

DOCUMENT RESUME TM 023 993 ED 384 672 Jeffrey B. Martinez, Michael E.; Jenkins, AUTHOR Development and Figural-Response Assessment: System TITLE Molecular Biology. GRE Pilot Research in Cell and 89-02P. Board Professional Report No. Princeton, NJ. Graduate Educational Testing Service, INSTITUTION Record Examination Board Program. Board, Princeton, Graduate Record Examinations SPONS AGENCY N.J. ETS-RR-92-50 REPORT NO Jan 93 PUB DATE 35p. NOTE Research/Technical (143) Reports PUB TYPE MF01/PCO2 Plus Postage. EDRS PRICE *Educational *College Students; *Cytology; DESCRIPTORS *Molecular Biology; Assessment; Higher Education; Ability; *Test Pilot Projects; Scoring; Spatial Verbal Ability Construction; Test Items; Test Use; Record *Figural Response Items; *Graduate IDENTIFIERS Examinations; Open Ended Questions ABSTRACT Examinations (GRE) The purpose of the Graduate Record prototype computer figural-response project was to design a items and scoring figural-response assessment system for delivering biology and to begin to in the domain of cell and molecular This report describes investigate properties of the item format. continuous and effort that is intended to be progress to date in an Features of the delivery system are lead to program implementation. ancillary developments, such as a described, with sample items, and from a pilot research study are tutorial, are also noted. Findings to examine the described. The essence of the pilot study was (figural-response and open-ended relationships between two item types figural and verbal ability for verbal questions) and measures of (n=4). The students and professors undergraduates (n=17) and graduate items draw from verbal ability, data hint that whereas verbal and verbal ability. The figural-response items draw from figural of possible new directions for report concludes with a discussion use of the item format. research, development, and eventual program study findings. An appendix contains Two tables present some pilot (Contains 17 references.) sample computer test screen items. (Author/SLD) *********************************************************************** best that can be made Reproductions supplied by EDRS are the * from the original document. * *********************************************************************** RESEARCH IMP a A I 1 SI A. a 1 "PERMISSION TO REPRODUCE THIS MATERIAL HAS BEEN GRANTED BY U II DEPARTMENT OF EDUCATION (Moms Of Educational Research and Improvement EDUCATIONAL RESOURCES INFORMATION ar CENTER (ERIC) as his dOCument NM been reprOduced receive(' from the person or organization Originating it improve C MinOr changes have been made to TO THE EDUCATIONAL RESOURCES kl (production Quality INFORMATION CENTER (ERIC) Points or toes. or opinions staled in Int% Uocu othc.ai mint do not necessarily represent OERI posthon or pohcy I . Figural-Response Assessment: System Development and Pilot Research in Cell and Molecular Biology Michael E. Martinez and Jeffrey B. Jenkins GRE Board Report No. 89-02P January 1993 This report presents the findings of a research project funded by and carried out under the auspices of the Graduate Record Examinations Board. Educational Testing Service, Princeton, N.J. 08541 3 Testing Service are The Graduate Record Examinations Board and Educational dedicated to the principle of equal opportunity, and their programs, services, and employment policies are guided by that principle. EDUCATIONAL TESTING SERVICE, ETS, the ETS logo, GRADUATE RECORD Testing Service. EXAMINATIONS, and GRE are registered trademarks of Educational Copyright © 1993 by Educational Testing Service. All rights reserved. Abstract The purpose of the GRE figural-response project was to design a prototype assessment system for delivering and scoring figural-response items in the domain of cell and molecular biology, and to begin to investigate properties of the item format. This report describes progress to date in an effort that is intended to be continuous and lead to program implementation. We first describe features of the delivery system and give sample items; ancillary developments, such as a tutorial, study are described. The essence of the pilot are also noted. Then, findings from a pilot research study was to examine the relationships between two item types (figural-response and open-ended verbal questions) and measures of figural and verbal ability. The data hint that whereas verbal items draw from verbal ability, figural-response items draw from figural and verbal ability. The development, and report concludes with a discussion of possible new directions for research, eventual program use of the item format. 5 Introduction As a practice and as an industry, testing is simultaneously being viewed with criticism for its shortcomings and eyed expectantly for its potential to improve education. Perhaps in part because of this attention, important new developments in testing have begun to emerge. Among the most prominent of these developinents are behavioral anchoring (proficiency scaling), incomplete block sampling designs (Messick, Beaton, & Lord, 1983), testlets (Wainer & Kiely, 1987), and diagnostic models that are compatible with item response theory. (Tatsuoka, 1990). Further removed from traditional large-scale testing are portfolio and performance assessment. Another line of development looks to computer delivery of tests and compatible new technologies, such as computer adaptive testing (Reckase, 1989). Yet another line of research deals with constructed response items. Two of these developmentstechnology and constructed response itemsplay a role in the project described here. The figural-response item format, the focus of this study, is defined by two features: constructed responses and the expression of proficiency through the manipulation of figural (pictorial) material. Computer delivery, though used in this project, is not a required feature of figural-response assessment. Figural-response items present an examinee with a picture or diagram and ask the respondent to carry out some task on the figure. In the domain of biology, these tasks might include labeling particular structures (such as a cell nucleus) or assembling structures from components (such as an organic molecule from atoms). The range of items possible and their potential value to assessment is open and amenable to research. Constructed responses are often viewed as desirable, in part because they appear to reflect some target competencies much better than do multiple-choice questions. There is evidence that constructed-response items elicit cognitive processes that are qualitatively distinct from the kinds of thinking tapped by multiple-choice questions (Snow, 1980; Martinez & Katz, manuscript submitted for publication). The figural aspect is also important: Educators have argued that the dominant symbolic modes of formal education, including assessment, are verbal and logico- mathematical (Gross, 1974; Shavelson, Webb, & Lehman, 1986). These modes do not capture all possible ways of knowing, and in certain, visually oriented fields, communication of ideas in verbal form can distort their most direct and natural representation and hinder problem solving (Larkin & Simon, 1987). The potential applicability of figural-response items is likely to vary from domain to domain. The item type is especially suited to content areas that are highly visual or graphical, and the format may enable the assessment of knowledge that cannot be tapped by verbal or quantitative representations or by more static means of testing. Biology, because it is so visual, invites this form of assessment, but assessment in other subject areas, such as engineering, might also be enhanced by the inclusion of figural-response items. Even in fields that are not predominantly graphical, it seems likely that figure-based assessment could draw upon understandings that are tapped poorly or not at all by other assessment forms. One can imagine asking a student to place key events on a timeline to demonstrate an understanding of event precedence and causality in history. For the researchers involved, the motivation behind this project was a belief that items calling for constructed responses within a figural medium fill a gap in assessmentand also in instruction. What remained to be seen was, given the self-imposed constraints on the item type, whether tasks generated would have at least a face validity and appear to add value to assessment when combined with more typical kinds of questions. Apart from many technical challenges, the potential research issues, revolving mostly around validity, are many and of great practical importance. Finally, technology was an important aspect of the project because automated scoring was presumed to be virtually a prerequisite for large-scale use of the item format. 1 fi Project 13k;. Jcground and Overview developed for the National Assessment In its first instantiation, figural-response items were printed on paper. From the beginning of the of Educational Progress science assessment and This technology was developed, but not to the project, there was an interest in automated scoring. needed for program testing (Martinez, Ferris, point where it could be used with the reliability work was proposed, computer delivery of items was Kraft, & Manning, 1992). When the current collect the kinds of responses recommended for two main reasons. The first is that computers can assembly of structures from components). A possible with paper and pencil, plus more (including of paper-and-pencil scoring are no longer second advantage is that some of the technical problems problem of locating a response on the problems with the computer. A ready example is the because variations in sheet feeding and graphic. With paper-and-pencil delivery, this was not easy of a mouse shrinking made the process less sure. Finding the location paper imperfections and determining the location of any object or the click on a computer is trivial by comparison, as is for selecting computer delivery is that the beginning and ending points of a line. A final reason for at least the verbal, quantitative, and GRE program was headed steadily in this direction analytical sections of the General Tests. development aspects of our work The project was primarily a development project; hence, research This does not downplay the importance of future research; are emphasized in this report. by any new item fornu c. The report is is needed to shed light on the meaning of what is measured results of pilot research. The development organized around the products of development and the essential features of the delivery portion is in a sense archival: The intent is to document the something of a chronicle of our progresseven, and system. Another function is to provide less-than-straight path might be of use perhaps especially, our missteps. An understanding of our ourselves of potential pitfalls in the development of a new to other researchers and reminders to the path and products of development, research assessment technology. Following a section on of a number of possible research from a pilot study is presented. The pilot study focuses on one aptitudes. The report concludes with a perspectives, namely, connections between item format and GRE program testing. discussion of the potential contribution of figural-response assessment to System Development (FRAME) Figural Response Authoring and Measurement Environment delivery system for The most significant product of the GRE Figural Response Project is a section is to provide an figural-response items, which we call FRAME. The purpose of this passing, it is worth noting that overview of the most important features of the delivery system. In constructed fairly rapidly, within six months of the outset of the a functional delivery system was the project. These project. Refinements to the delivery system continued over the life of professionals in the field modifications were based on suggestions given by research subjects and confident that the delivery vehicle is well of interface design. Feedback we have received makes us experience with computers. designed and easy to useeven for someone who lacks significant in FRAME Version 2.0 presents a user with two types of displays: (a) a navigation screen, which shows the item which a list of items and their statuses are reported and (b) an item screen, and the tools needed for answering the item. stern, the figure on which the response is made, respectively. All system input is Sample navigation and item screens are shown in Figures 1 and 2, the pilot research, verbal made through a mouse. The only exception is when, as in the case of Incidentally, this illustrates (typed) responses are called for, in which case the keyboard is used. general assessment vehiclefor that the figural-response delivery system can be used as a multiple-choice, verbal response, and figural-response questions. d d d d d 6 d d d d d d d d d d d d d d d e e e e e 1 e e e e e e e e e e e 9 e e e e t t t t t t t t t t t t p p t t p p p t t t 0 p p p t t p p t t p p p p p p p p p p m m m m m m m m 0 m m m m m s m m m m m m m : u e e e e e e e e 0 e e e e e e e e e e e e t t t t t t t t t t t a t t t t t t t t t t t t t t t t t t t t A A t t t A A A t t t A A A t t A A t t t A A A A A A A A A A S e t t t t t t t t t t o o t t o o o t t t m o o o t t o o t t o o o o o o o o o N N N N N N N N N N . N N i N N N N N N N N t n t a R R R R e R R R R R R R R R m R R R R R R R F F F F d F F F F F F F F F F F F F F F F u r o t q F : D I i m i N B B E d d E n n R a a C . A A n S o s s N i e e t O s m m I e T u y y A q z z G n n s a A I V e e s N A r e o D o f f N d r i i s s c 2 s s n y s d i 4 1 s * e e h o o d a e 1 0 , y e e t t 4 t q r d i i i i r t r l c n 4 g s s d , u o n b e n n i e l a b c y t t n g g a i c s n n i c s r h y r c a r n n e t a e o o n o o e n l i S l i i d g d b e t i t i c a t t e c t t e p t t o i i u u e : u u y 2 u : b b : f i l e : e 1 m r e q l c o l l c c q e i * i g b c g r h h r * y e v e a v - u s a n - e n e e n C w s - r e L s r e o u u L o u d i h h i u d e d D D q q d k i t k c d o t t s c t t o c i i c s i s i c c c v i c i e e c s c d u n c s n a u n a a e a s o t t t s t e r A y o t o b a a r t t t b o r i t e i p e e o t e t l s i o s s d i c c o s d c t i a t l c o n l n t s i n o n p e a i i p n e A c t e a d d i i i e n n o i t t r e e m i i p o y r e m o r N i c r r n n t u u M u t C M t F l x C T i i T F e i P G A i C I I A Q D P T P T E l 9 7 8 6 1 e 3 0 2 5 4 0 9 8 7 4 5 1 6 3 2 1 1 1 1 S 1 2 1 1 1 1 1 , : 1 D ' E D o T E P i B K M S W W S R E M E E M E T T I A E I V O E V R T E T M E T N A R G I R I " 2 s 3 t s : n 0 d e 0 e s r : a e d s s 0 s p s e d s d e l e e e o p h d s e e b e t y o w e s m m s t n B o o l w 0 i e y T l n k o n d s . e _ l s e n l e n r d e e i l h G e u n Y l r e r P o e a w y g t r l l n e a a e r e = = = = a t d . n P k u u e r Y c y R q t d i s s l c n : t ' e d t D e p n I n e a . n d d u t n e c P i t e p o j e b . m w n h o o e t t e i t s g t t a h A e n r t v i e r s t n l o o n U o N v e . n s g ) : i r y s Y l i t Y F e u a r h t r h e a R s t h t s S d e t o e p f r o y c y m R 0 t e e o r 2 p o n h y f f e t o r t e g n o 0 0 O ' p ( 1 o n Y 0 i r e R s t o i h i s l o p s o l t c s p n d o y y Y Y n d e , r R t n U R t r c c r c a e p e d j r e p b i f r p x o b a e y n h h e e s i h d e r t o a v e f o o t , i s e s m b u l t p n o o m a g T l y o p z c a o , e r T W d R e X E p E E e : K E T t . V I E R e V A N d R g O o A g E G O h E i f M R n i - v T B I S r g V V R P i A o e A s E R A E r r R N U E T p a R P S G = 0 Navigation screen. The navigation screen provides the user with a list of items that can be selected and attempted in any order. At the top is an administration line which lists the name of the line, and above the list test, the ID of the examinee, and the elapsed time. Below the administration of items is the stem area. When an item is selected, the verbal instructions to that item are displayed in the stem area. The main window on the display shows, for each item, its number, a verbal descriptor for the item, the format of the item (such as FR for figural-response or MC for multiple-choice) and the status of the item (i.e., Attempted, Not Attempted, or Marked, which is explained below). If the number of items exceed: the display capacity of the screen, arrows for scrolling or paging are shown beneath the list of items. At the left are buttons that, when selected, automatically display only those items that have the characteristic shown on the button. For example, if the first button is chosen, only items that have the status Marked for Review will be displayed when items are viewed in sequence. Below the list of items is a context-sensitive help line that shows very simply and in general terms what the user is to do next, based upon the previous step. The button on the lower left, Exit Exam, permits the examinee to quit the test. Item screen. At the top of the item screen display is an administration line that shows the of the item, the ID of name of the test, the name of the item, item number out of a total, the status the examinee, and the elapsed time. Below that is the item stem, where the verbal instructions to the item are given. The size of the box shown can accommodate five lines of text, which has been sufficient for all items we have created so far. Limitations on stem length are actually desirable because we wanted to keep the average response time per item to about 2-3 minutes. The stem area accordingly. The largest area of can accommodate long verbal instructions, if needed, and expand the screen we refer to as the work area. This contains a figure that is manipulated or modified. If objects are to be moved around the screen, these are usually placed on the right side of the work area for the sake of consistency. Figures are bit-mapped images stored in a file separate from the delivery system. On the left-hand side of the screen is a row of buttons. The buttons at the top are tools used to respond to the item. The tools shown in Figure 2 are the Move Object and Erase tools; other tools -re Draw Line (straight), Draw Line (free-form), Draw Arrow, Rotate, and Label. Only the tot Is needed to answer each question are provided with that item. At the onset of the project, we were not sure what set of tools we would ultimately have. This small set of tools is extremely flexible in the kinds of tasks it can facilitate. The lower buttons handle administrative functions. For example, the Start Over button will redraw the current item, which is especially helpful if the examinee gets off to a bad start in answering the item. The Mark for Review button is useful if, after answering an item, the examinee wishes to return to that item at some later time. The new status, Marked, will then be shown in the column marked Status on the navigation screen. The Marked item is unmarked simply by clicking again on the button. Below that is a split button that will either advance to the next item or return to the previous item according to the order shown on the navigation screen. The last button, marked Navigate, returns the user to the navigation screen. Software and hardware. The figural-response delivery vehicle was programmed in EASIS, Research Group and used in a C-based programming language developed by ETS's Technology building the National Council of Architectural Registration Boards (NCARB) simulations prototype and NCARB figural-response items. Because of the computational demands of scoring figural-response items real-time, Borland's C is being used to develop the scoring system. Object- oriented C++ is also being used to increase the efficiency of scoring and to improve the transportation of code between related projects. The hardware platform requirements consist of an IBM-compatible 286 microcomputer, a high-resolution VGA (640 x 480) graphics display, and a of a math coprocessor mouse. A 386-based micro is recommended. FRAME will take advantage if available. 1 2 5

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.