ebook img

ERIC EJ904035: Accuracy and Efficiency of Data Interpretation: A Comparison of Data Display Methods PDF

2008·1.6 MB·English
by  ERIC
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview ERIC EJ904035: Accuracy and Efficiency of Data Interpretation: A Comparison of Data Display Methods

Accuracy and Efficiency of Data Interpretation: A Comparison of Data Display Methods Elizabeth Lefebre, Michael Fabrizio, and Charles Merbitz The Chicago School of Professional Psychology Although behavior analysis relies primarily on visual inspection for interpreting data, previous research shows that the method of display can influence the judgments. In the current study, 26 Board-Certified Behavior Analysts reviewed two data sets displayed on each of three methods—equal-interval graphs, tables, and Standard Celeration Charts—and determined which data set showed more change per unit of time and which showed less variability. Regardless of their preference for one method of display, participants handled data displayed on the Standard Celeration Chart most efficiently. However, the accuracy of their judgments across all types of data display was low and may indicate the need for ad- ditional training, even for persons holding the Board-Certified Behavior Analyst credential. DESCRIPTORS: data analysis, graphic displays, Standard Celeration Chart The Importance of Data-Based Decision Making and ronment (Behavior Analysis Certification Board®, Prediction 2004). Behavior analysts must also be familiar with The experimental analysis of behavior rests the characteristics of the client’s behavioral data so on baseline logic, a form of inductive reasoning they can both easily predict future performance (Cooper, Heron, & Heward, 1987)1 that rests on and quickly detect potential problems with pro- an assumption that if a participant responds at a gramming or other environmental factors. Further, steady state under a set of conditions, changes in they must easily recognize features of their client’s the dependent variable that occur as a result of data that may hold important clinical implications, changes in the independent variable become easier such as the presence of normal or abnormal levels to detect. As more and more baseline data are col- of variability, and also the speed with which a cli- lected, predictive power increases until a stable ent acquires new skills. For example, if a client’s state of responding occurs (Baer, Wolf, & Risley, responding during a new intervention becomes 1968). As research progresses, the experimenter extremely variable, the behavior analyst should makes predictions about the participant’s respond- be able to look back at previous response patterns, ing, and future data either verify or refute those detect any difference in variability, and make ap- earlier predictions. When the conditions change propriate changes to the program. Prediction based and the participant’s responses change accord- on data also allows changes in programming so a ingly, the experimenter compares changes in the client will meet intervention goals in an efficient participant’s responding under the new conditions manner. with responding predicted had the old conditions stayed in effect. Common Methods of Prediction Just as prediction plays an important role Experimenters use two types of tools to make in research, it plays an equally important role in predictions: line graphs and statistics (Cooper et al., clinical intervention. Behavior analysts must know 1987). In a statistical approach, an experimenter how their clients respond to changes in their envi- or behavior analyst derives an equation for a line that best fits the data set and then uses that linear This study was completed in partial fulfillment of the requirements for the degree of Master of Arts in Applied equation to predict where the next data point(s) Behavior Analysis by the first author. The authors thank should fall. From these predictions, the behavior Behavior Research Company for granting its permission to analyst then makes ongoing decisions based on the reproduce the Daily per Minute (Dpmin-12EC) Standard Celeration Chart. Correspondence about this study should 1Although there is a newer edition of Cooper, Heron, and be sent to: Michael Fabrizio, 1110 24th Avenue South, Se- Heward’s book (2008), the text cited in this manuscript attle, WA 98144-3037. comes from the earlier edition (1987) of the book. 2 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 degree of alignment between prediction and actual important role in the decisions made for a client, observations. When using a graphic analysis, an behavior analysts and researchers must choose a experimenter or behavior analyst does not rely on display that most clearly represents the behavior of mathematically calculated values for predictions, interest as it occurs naturally in the environment. but instead plots data on a graph and makes predic- tions from the current data about the future data Commonly Used Types of Graphic Displays by relying primarily on visual inspection of such Tufte (1983) described the purpose of graphic features as the data’s level, trend, and variability. displays as showing data and inducing the viewer The Importance of Visual Analysis Tools and Proce- to pay attention to the data themselves rather than dures to other features of the display. According to Tufte In applied behavior analysis, visual analysis (1983), displays should avoid distorting what the of graphic displays is the most common method data say and should encourage the viewer’s eye to used to evaluate the effects of changes in indepen- compare different parts of the data with one an- dent variables (DeProspero & Cohen, 1979). Visual other. Behavior analysts have tended to use three records and displays of data validate whether clini- main types of data displays: (1) line graphs, such as cal interventions are associated with the intended, equal-interval graphs or semilogarithmic charts; (2) positive differences in a client’s responding. Re- data tables; and (3) cumulative records (Cooper et searchers and practitioners can also see whether al., 1987). Of these three types of displays, behavior they have isolated and controlled important vari- analysts rarely use cumulative records, and instead ables affecting the behavior of interest; they can also most commonly use equal-interval line graphs respond to changes in environmental variables as (Cooper et al., 1987). those changes occur, and in cost- and time-efficient With equal-interval graphs, also called ways (Parsonson & Baer, 1978). add-subtract graphs, both the vertical and horizon- In applied settings, data should drive a feed- tal axes are divided into equal distances between back loop in which changes in client performance marks to denote the addition or subtraction of con- occasion changes in the intervention or reinforce stant amounts. These graphs display some dimen- maintenance of the current conditions. In such a sion of the dependent variable on the vertical axis feedback loop, data should serve as stimuli that (typically, percent correct or frequency of respond- occasion changes in the behavior analyst’s behavior ing) and some dimension of time (typically, days to make changes in procedures when warranted or weeks) or an analogue of time (sessions) on the and to predict future outcomes more accurately. horizontal axis (Cooper et al., 1987). Because of their Given this important role in clinical intervention, mathematical properties, equal-interval graphs data should be collected and displayed in a way tend to produce curved rather than straight lines that allows behavior analysts to make fast, easy, to display human learning data. Because equal-in- and accurate decisions (Johnston & Pennypacker, terval graphs tend to produce curved lines when 1993). they display learning data, those curved lines likely To facilitate fast, easy, and accurate decision negatively affect the behavior analyst’s ability to making based on graphed data, important dimen- accurately and quickly predict future performance sions of the data such as trend, level, and variability or to easily compare variability across the full range should be readily distinguishable. Data displayed of responding. graphically should enable behavior analysts and Semilogarithmic graphs, such as the Stan- researchers to see patterns in the data and to make dard Celeration Chart (SCC) developed by Lindsley comparisons across changes in treatment condi- in the 1960s, display frequency measures along tions, since such comparisons have proven more the vertical axis. Celeration refers to change in useful in influencing interpretive reactions than frequency of behavior across time and is depicted statistical analyses (Johnston & Pennypacker, 1993). on the SCC as the trend in the data. In contrast to Given this reliance on visual rather than math- equal-interval graphs, the vertical axis of the SCC ematical properties of the data, the type of graphic has a multiplicative scale; that is, equal distances display selected may play an important role in the on the graph are equal ratios instead of additive analysis. Because the type of graph chosen plays an distances. A doubling of the response frequency 3 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 would cover the same vertical distance on the SCC large changes in trend and level, but they did not regardless of the base frequency. Thus, data that focus on comparable changes in variability across grow from a frequency of two per minute to four phases. The researchers suggested that variability per minute would cover the same vertical distance should be defined clearly in the literature to clarify as data that grow from a frequency of 500 per min- the guidelines for its use as an interpretive dimen- ute to 1,000 per minute. On the horizontal axis, the sion of visual analysis. The scaling transformations SCC displays units of real calendar time such as also altered participants’ decisions about experi- days, weeks, or months. When data are plotted on mental control demonstrated on the graphs, since the SCC, the multiplicative scale of the ordinate the transformations seemed to be correlated with shows proportional change in responding. Because a change in response. This prompted a suggestion it reliably produces a straight line with human that experimenters should pay closer attention to learning data (Lindsley, 1991), standard trend (or scale when analyzing their data. celeration) measures can be calculated and drawn Mawhinney and Austin (1999) examined easily and quantified visually. This is important the speed and accuracy with which an expert in because changes in trend are often a major factor each of three types of visual displays identified the in an analysis of data (DeProspero & Cohen, 1979). onset of intervention: equal-interval graphs, SCCs, With the SCC, behavior analysts may be able to and Statistical Process Control (SPC) Charts. These detect important changes in a client’s performance experts reviewed data sets and plotted the data more easily than with other graphic displays that on their preferred chart, point by point. Accuracy, do not consistently produce straight lines. as measured by the expert’s ability to identify the location of the onset of an intervention, was higher Previous Research on Visual Analysis of Graphed for the data plotted on traditional equal-interval Data graphs (63% correct), followed by SCCs (32% cor- Stevens and Savin (1962) replotted data from rect), and then SPC charts (25% correct). However, a selection of cumulative learning experiments the SCC proved the most efficient method of data onto graphs that had logarithmic scales on both analysis because the SCC expert spent significantly the ordinate and the abscissa. They found that all less time completing the tasks than either the 8 data sets created a straight line when plotted on equal-interval or the SPC expert. The SCC expert the log-log graph, and a line of best fit could easily took less than half the time to plot the data than be fitted to the data by eye without the need for a did the equal-interval or the SPC expert, although statistical procedure. Although log-log graphs can the SCC expert’s correct response frequency was produce very straight lines and can be very help- lower. These findings along with the others suggest ful for predicting future behavior, such graphs are that the different display methods used to analyze rarely used in clinical practice. data may have an effect on the viewer’s ability to Furlong and Wampold (1982) assessed the accurately interpret data. ability of experts trained in single-subject design to sort graphed data into groups showing similar ex- The Current Study and Its Goals perimental effects. The researchers transformed the Similar to the study of Mawhinney et al. graphs in one of three ways. One group of graphed (1999), in the current study, the efficiency with data underwent scaling transformations in which which behavior analysts make accurate data-based the vertical axis was stretched. The second group of decisions was assessed with equal-interval graphs, graphed data underwent transformations in vari- semilogarithmic charts, and data tables. The previ- ability in which the standard deviation of random ous research points to the importance of studying deviates was multiplied by the same constant. Such the variables that affect interpretive behavior. This a transformation produced data with larger, more is critical because behavior analysts make most of random levels of variability. The third group of their decisions via visual analyses of data. The cur- graphed data underwent a standard transformation rent study sought to determine the accuracy and in which a randomly selected deviate was added efficiency of making data-based decisions across to each value in the data pattern. Furlong and different display types and the influence of pref- Wampold (1982) found that the raters attended to erences for and experience with certain types of 4 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 graphs on participants’ ability to make data-based graphs as their primary method of interpretation; decisions. 2 participants reported that they used data tables; 5 participants reported that they used SCCs; and 3 METHODS participants reported that they used all three dis- play methods for interpretation. This information Participants and Setting allowed participants to be placed into groups based A total of 26 Board-Certified Behavior Ana- on their affinity toward one type of display. lysts who held a minimum of a master’s degree After completing the survey, participants participated in the study. Participants were selected indicated which display method they preferred through the certificant registry on the Behavior and which display method they found easiest to Analyst Certification Board™ web site using a se- interpret. As to their preference, 12 participants lection procedure in which every ninth certificant in reported that they preferred equal-interval graphs, each state within the United States received an in- 1 participant preferred tables, and 13 participants vitation to participate. Because each certificant had preferred SCCs. As to ease of interpretation, 10 to be individually invited to participate, it was not participants reported that SCCs were the easiest feasible to invite all certificants (greater than 3,000 method, 15 participants reported that equal-interval at the time this research occurred) in the registry. graphs were the easiest, and 1 participant reported These 26 participants responded from a pool of 446 that tables were the easiest. who were invited. For those potential participants who did not respond to initial invitations, a maxi- Materials mum of two follow-up e-mails were sent to them A Web site designed by the first author reminding them of the opportunity to participate. served as the stimulus presentation and data-col- This pool of professionals was selected because they lection vehicle for this study. The Web site collected all had previously met the minimum requirements data both on responses selected and on response for certification by the Behavior Analyst Certifica- times for each participant. This Web site led par- tion Board™ and, as such, likely had both training ticipants through a number of pages, including and experience in analyzing data derived from be- a letter of agreement, an informational question- havior analytic interventions. Participants received naire, several pages of instructions, and specific invitations via e-mail, and they logged into the Web tips to remember when answering questions. The site to participate at their convenience. questionnaire included questions about the par- ticipants’ experience in the field, the degree held, Demographic Information About the Participants their number of years practicing behavior analysis, Demographic data collected from partici- and the type of data display they most commonly pants included information about their experience used. The survey began after both the informational in the field, the highest educational degree they questionnaire and the instructions, and it consisted held, the number of years they had practiced be- of graphic displays or data tables depicting 15 sets havior analysis, and the type of data display they of data, each graphed on an equal-interval graph most commonly used. Participants also indicated and an SCC, as well as in tabular form. The SCCs which data display (equal-interval graphs, SCCs, were constructed with an SCC template for Mi- or data tables) they thought easiest to interpret and crosoft Excel (Harder, 2008); equal-interval graphs which display they preferred. were developed with Microsoft Excel’s scatter plot All participants in the study held certificates graphing function; and data tables were created in as Board-Certified Behavior Analysts for a median Microsoft Excel. of 3 years, with a range of 2 months to 7 years. The data sets used in the study consisted The median number of years participants had ex- of both real data sets and modified data sets, all perience in interpreting data was 8 years, with a depicting AB designs. Of the 15 data sets, 6 were range of 3 to 30 years. Of the participants, 16 held real data sets collected within early intervention ap- a master’s degree and 10 held a doctoral degree plied behavior analysis programs for children with as their highest level of education. Of the 26 par- autism and overseen by behavior analysts, and 3 ticipants, 16 reported that they used equal-interval of the data sets were generated by college students 5 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 as they performed flashcard timings designed to they were to act as a behavior analyst in a clinical teach them verbal relations important to graduate setting. The vignette asked participants to make coursework they completed. The remaining 6data data-based decisions about the trend and variability sets were modified versions of 9 additional real data of the data they would see on the following pages. sets. The data sets were modified by a change either At the bottom of each page of instructions, a “next” in the length of the observation or in the frequency button appeared that enabled the participant to of certain data points in the data set to increase or advance to the next page. The vignette read: decrease the variability or to modify the slope. These data sets were modified to provide a range You have been hired at a central office to of stimuli that best represented how the different consult on several different behavioral types of displays show the same data sets differ- programs running at different sites. The ently. The modified data sets were changed so that staff use different data collection systems the slopes and variability of the two data paths and display their data for interpretation differed by a ratio of at least X1.25 on an SCC. A on different types of visual displays. You difference in celeration and variability of at least will be looking at pairs of data sets from X1.25 appears to be a readily detectable difference these programs. On each page, panel A between two sets of data plotted on the SCC. Each and panel B show the results of two be- display consisted of two data paths, displayed in havior improvement procedures for a cli- two panels (Panel A and Panel B) on the graph. ent. Your decision will be to recommend Each data path differed from the other by the slope Procedure A, Procedure B, or either one of the trend line, the variability, or a combination for future work with that client, based on of both. which procedure has been more effective/ efficient overall. Staff members have also Dependent Measures complained about client inconsistency, so The dependent measures for the study were you will also select the panel showing the the participants’ frequency of correct and incorrect least variability, or indicate that the vari- identification of differences in trend and variability ability is the same. Finally, in some cases per minute, along with interresponse time. Inter- you may need more information to make response time was calculated to determine the time these decisions, so for either question you each participant took to respond to each stimulus. may indicate that there is not enough A linear regression equation was used to calculate information to make the choice. the slope of each line created by the data in each panel. The display showing the most success in in- The next page following the vignette read: creasing the behavior of interest (the steepest trend) determined the accuracy, and the panel showing a It is important to remember the follow- greater slope became the correct answer. For de- ing when making your decisions: Both tecting amounts of variability, the range that the procedures took the same amount of staff data deviated from the slope was calculated. The time/resources per session. In each proce- panel showing a smaller variability value became dure, the client reached the goal that was the correct answer. set. In some pairs, the client may have Procedure started at different levels with the two The Web site led participants through a se- different procedures. With some data sets, ries of pages before they accessed the data sets. The clients may have had unusually good or first page included the introduction, explanation, bad days that are probably not associated and purpose of the study as well as a letter of agree- with the procedure. Now staff members ment and experimenter contact information. After are working to increase or decrease other reading the introduction and agreeing to partici- behaviors, so you are making important pate, participants proceeded to the informational recommendations for future work. questionnaire. After completing the questionnaire, participants then read a vignette explaining that Following the instructions, the first page 6 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 of the survey appeared. The participants saw the display appeared and the questions repeated. equal-interval graphs first, with data sets presented in random orders, followed by the data tables, and Accuracy Calibration then SCCs also in random order. Each display ap- Accuracy was calibrated between the first peared on two pages, making the survey a total of author’s assessments and the Web site’s server for 92 pages, including the 15 data sets that each ap- both the correctness of answers selected and the peared two times across the three different display time taken to answer each question. Accuracy of types. recording the selection of the same answer was The first page (Figure 1) included a graphic calculated from the total number of agreed-upon display and the following question: “If the goal was responses divided by the total number of possible met for each procedure, which procedure was most suc- responses. Accuracy scores for recording the se- cessful in increasing the behavior in the quickest, most lection of the same answer were 100%, and 92% economical way?” of time intervals were within 3 seconds of each Each page had four options for answering other. Here, differences in the Internet connection the question: “Panel A,” “Panel B,” “Either One,” speeds may have accounted for the differences in and “Not Enough Information,” as Figure 1 shows. frequencies collected. Because this study sought When the participant clicked on an answer, the primarily to determine which of three types of Web site recorded the time and advanced to the data displays allowed more accurate and efficient next page. data-based decision making in clinical situations, a This next page included the same graphic time difference of less than 3 seconds would likely display and the question, “Which program shows less not translate into clinically significant differences variability?” The same options as on the previous in ease of interpretation. page were offered for the participant to answer the question. Each page also included a button entitled RESULTS “Review Instructions” that opened up a separate page with the instructions previously presented. Response Frequencies-Rate of Change-All participants Following the page with the second question, a new With all participants considered, the correct Figure 1. The display participants viewed as they moved through the survey. 7 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 _ _ _ _ _ _ 0 median percentage of correct and incorrect responses ns about both rate of change and variability. orrect Response Rates Across Display Methods Table Standard Celeration Chart Correct Incorrect Correct Incorrect 1.44 1.52 2.54 2.09 1.25 1.22 2.44 1.99 0.29 – 3.11 0.37 - 3.53 0.62 – 6.28 0.64 – 4.19 47% 48% 57% 42% 0 – 67% 0 – 80% 20 - 80% 20 – 67% ____________________________________________________________ 2.04 4.21 3.72 7.87 1.78 3.75 3.33 7.40 0.49 – 5.71 0.79 – 15.71 0.81 – 9.79 2.69 – 20.0 33% 63% 33% 66% 0 – 67% 0 – 93% 7 – 60% 40 - 93% d io nc __ dian, range, anhods for quest Correct and I al Graph Incorrect 1.78 1.78 0.35 - 3.16 53% 33 – 73% _____________ 3.50 3.50 0.16 – 8.14 53% 0 – 73% memet erv __ t _ able 1. Mean, cross display Equal In e Correct 1.45 1.39 0.41 – 3.00 45% 20 - 67% ____________ 2.71 2.48 0.30 – 6.10 40% 0 – 73% Ta g _ n _ a s _ s Rate of Ch Mean Median Range Median Percentage Range of Percentage Variability Mean Median Range Median Percentage Range of Percentage Figure 2. Overall correct and incorrect response frequencies for rate of change and variability → across equal-interval graphs, tables, and SCCs. Each dot represents one person’s correct frequency and each x one person’s error frequency. 8 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 9 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 Table 2. Mean, median, range, and median percentage of correct and incorrect responses across display methods for the equal-interval affinity group for questions about both rate of change and variability. Correct and Incorrect Response Rates Across Display Method for the Equal Interval Affinity Group Equal Interval Affinity Group Equal Interval Standard Graphs Tables Celeration Charts Rate of Change correct incorrect correct incorrect correct incorrect Mean 1.32 1.82 1.45 1.61 2.44 2.08 Median 1.28 1.75 1.24 1.38 2.10 1.96 Range 0.41 - 2.35 0.35 - 3.16 0.35 - 3.16 0.69 - 3.53 0.62 - 6.28 0.64 - 4.19 Median 40% 57% 47% 53% 53% 47% Percentage Range of 20 - 67% 33 - 73% 0 - 60% 0 - 80% 13 - 80% 20 - 60% Percentages Variability Mean 2.41 3.5 2.10 4.58 3.31 8.24 Median 2.11 3.19 1.78 4.20 3.18 7.07 Range 0.97 - 6.10 0.16 - 8.14 0.86 - 5.71 1.18 - 15.71 0.81 - 7.35 2.69 - 20.00 Median 37% 60% 27% 67% 33% 67% Percentage Range of 27 - 73% 27 - 73% 0 - 60% 0 - 80% 7 - 40% 40 - 93% Percentages median response frequencies on the rate of change equal-interval graphs, and then data tables. were higher on SCCs, followed by equal-interval graphs, and then data tables. Figure 2 and Table Response Frequencies-Variability-All participants 1 show these results. The SCCs also had a higher Correct response frequencies on variability percentage of correct responses, followed by tables, were also higher on SCCs (as Figure 2 and Table 1 and then equal-interval graphs for questions about show), followed by equal-interval graphs, and then rate of change. Table 1 shows these results. Error data tables. Error frequencies were also high across frequencies were high across all display methods, all display methods, with SCCs having the high- with SCCs having the highest, followed closely by est error frequencies, followed by tables, and then Figure 3. Correct and incorrect responses to questions about rate of change on each display method, → by each affinity group. 10 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20 11 JOURNAL OF PRECISON TEACHING AND CELERATION, VOLUME 24, 2008, PAGES 2-20

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.