ebook img

ERIC ED562577: Beyond Individual Differences: Exploring School Effects on SAT® Scores. Research Report No. 2004-3 PDF

2004·0.37 MB·English
by  ERIC
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview ERIC ED562577: Beyond Individual Differences: Exploring School Effects on SAT® Scores. Research Report No. 2004-3

Research Report No. 2004-3 Beyond Individual Differences: Exploring School Effects on SAT® Scores Howard T. Everson and Roger E. Millsap www.collegeboard.com College Board Research Report No. 2004-3 Beyond Individual Differences: Exploring School Effects on SAT® Scores Howard T. Everson and Roger E. Millsap College Entrance Examination Board, New York, 2004 An earlier version of this report was presented as the first author’s presidential address to the American Psychological Association’s Division of Educational Psychology, August 2000, in Washington, D.C. Howard T. Everson is vice president for Academic Initiatives and chief research scientist, the College Board. Roger E. Millsap is professor, Psychology Department, Arizona State University. Researchers are encouraged to freely express their professional judgment. Therefore, points of view or opinions stated in College Board Reports do not necessarily represent official College Board position or policy. The College Board: Connecting Students to College Success The College Board is a not-for-profit membership association whose mission is to connect students to college success and opportunity. Founded in 1900, the association is composed of more than 4,500 schools, colleges, universities, and other educational organizations. Each year, the College Board serves over three million students and their parents, 23,000 high schools, and 3,500 colleges through major programs and services in college admissions, guidance, assessment, financial aid, enrollment, and teaching and learning. Among its best- known programs are the SAT®, the PSAT/NMSQT®, and the Advanced Placement Program® (AP®). The College Board is committed to the principles of excellence and equity, and that commitment is embodied in all of its programs, services, activities, and concerns. For further information, visit www.collegeboard.com. Additional copies of this report (item #040481303) may be obtained from College Board Publications, Box 886, New York, NY 10101-0886, 800 323-7155. The price is $15. Please include $4 for postage and handling. Copyright © 2004 by College Entrance Examination Board. All rights reserved. College Board, Advanced Placement Program, AP, SAT, and the acorn logo are registered trademarks of the College Entrance Examination Board. Connect to college success and SAT Reasoning Test are trademarks owned by the College Entrance Examination Board. PSAT/NMSQT is a registered trademark of the College Entrance Examination Board and National Merit Scholarship Corporation. All other products and services may be trademarks of their respective owners. Visit College Board on the Web: www.collegeboard.com. Printed in the United States of America. Contents 3. High School Achievement Variables . . . . . . . . . 4 4. Extracurricular Activities Variables . . . . . . . . . . 4 5. Family Socioeconomic Background Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 6. Fit Statistics for Competing Structural Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Student-Level Models. . . . . . . . . . . . . . . . . . . . . . 7 Overview of Multilevel Latent-Variable Models. . . . .2 7. Estimated Slope Intercepts from Model 4. . . . . 7 8. Intraclass Correlation (ICC) Estimates Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3 from Multilevel Model . . . . . . . . . . . . . . . . . . . . 11 Student-Level Data . . . . . . . . . . . . . . . . . . . . . . . . .3 9. Estimates for Ethnic/Gender Paths to SAT Scores of the Within-School Model . . . . . . . . . 11 Background Data . . . . . . . . . . . . . . . . . . . . . . . .3 10. Standardized Path Coefficient Estimates for Latent Predictors of SAT for the School-Level Data . . . . . . . . . . . . . . . . . . . . . . . . . .5 Within-School Model . . . . . . . . . . . . . . . . . . . . 12 11. Standardized Factor-Loading Estimates Our Multilevel Modeling Approach. . . . . . . . . . . .5 for the Within-School Model . . . . . . . . . . . . . . 12 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5 12. Standardized Estimates for Ethnic-by- Gender Paths to Latent Variables for the Stage One: Student-Level Analyses . . . . . . . . . . . .5 Within-School Model. . . . . . . . . . . . . . . . . . . . . 13 13. Standardized Path Coefficient Estimates Model Specification. . . . . . . . . . . . . . . . . . . . . . .5 for the Between-School Model . . . . . . . . . . . . . 13 Parameter Estimates. . . . . . . . . . . . . . . . . . . . . .8 14. Standardized Factor-Loading Estimates for the Between-School Model . . . . . . . . . . . . . 13 Stage Two: School-Level Analyses . . . . . . . . . . . . .9 A1. Means of Measured Variables by Race/Ethnicity: Males. . . . . . . . . . . . . . . . . . . . . 18 Parameter Estimation. . . . . . . . . . . . . . . . . . . .11 A2. Means of Measured Variables by Race/Ethnicity: Females. . . . . . . . . . . . . . . . . . . 18 The Within-School Model. . . . . . . . . . . . . 11 The Between-School Model. . . . . . . . . . . . 13 Figures 1. Measurement model of the three latent Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .14 variables at the student level . . . . . . . . . . . . . . . . 6 2. Structural model of relationship among Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 measures of family background, high school achievement, and extracurricular Conclusion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 activities on SAT verbal and math scores . . . . . 6 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 3. Path model of effects of family background, high school achievement, and extracurricular activities on SAT Appendix. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18 verbal and math scores. . . . . . . . . . . . . . . . . . . . . 8 Tables 4. Measurement model of school-level effects . . . 9 1. SAT® Verbal and Mathematical Scores by 5. Within-school model . . . . . . . . . . . . . . . . . . . . . . 9 Gender and Race/Ethnicity . . . . . . . . . . . . . . . . . 3 6. Between-school model. . . . . . . . . . . . . . . . . . . . 10 2. Number of Students Responding to the Survey by Gender and Ethnicity. . . . . . . . . . . . . 4 Abstract Many policymakers believe that reducing or eliminating the achievement gap would go a long way toward reducing This paper explores the complex, hierarchical relationship racial inequality in the U.S. (Gordon, 1999). For example, among school characteristics, individual differences in when superintendents of large urban school districts were academic achievement, extracurricular activities, and surveyed recently, they listed the issue of the achievement socioeconomic background on performance on the SAT gap between minority and nonminority students as one of Reasoning Test™ verbal and mathematical sections. Using their major concerns (Huang, Reiser, Parker, Muniec, and multilevel structural equation models (SEMs) with latent Salvucci, 2003). means, data from a national sample of college-bound Educators, however, are perplexed when it comes high school students were analyzed. A nested series of to finding affordable ways of raising African American structural equation models were fit simultaneously to eight and Latino students’ achievement. For decades now, subgroups (disaggregated by both gender and ethnicity) particularly since the publication of the report by James of high school students. Analyses suggest that multilevel Coleman in 1966, Equality of Educational Opportunity, structural equation models provide a reasonably good many policymakers and educators have opined that fit to the data, that family background influences SAT® schools—and in particular our public schools—could do scores directly and indirectly, that learning opportunities little to address the inequities of poverty and the academic in and outside the school curriculum are related to SAT underachievement of minority students. It was a matter of performance, and that the characteristics of the schools underdeveloped academic ability, a view rooted in a strong matter when it comes to performance on the SAT. The belief in individual differences. Schooling had little or no paper’s main contention is that context matters and that effect on standardized test scores. researchers ought to move beyond analyses of individual As a consequence, relatively little effort was directed differences when attempting to understand performance at understanding subgroup differences—either gender or on large-scale standardized tests. race/ethnicity—in standardized test scores throughout the latter half of the last century, and even less attention was given to the complex interaction between gender Introduction and race (Anderson and Bruschi, 1993; Jones, 1987). The projected demographic shifts in the United States in the early decades of the twenty-first century will press In his award winning book, Savage Inequalities: Children educational psychologists and other researchers to better in America’s Schools, Jonathan Kozol writes of being understand the complexities of academic achievement startled by the “remarkable degree of racial segregation and the effects of schooling on standardized test scores. that persisted…and was common in the public schools” In response, we are seeing an increase in research aimed (Kozol, 1991, p. 3). Research on schooling in the United at identifying effective schools (see, for example, Cohen, States adds to the bleak picture painted by Kozol, Raudenbush, and Ball, 2002; Good and Brophy, 1986; indicating that “African Americans, Hispanics, and Hanushek, 1997; Lee, Bryk, and Smith, 1993; and Lee, American Indians—compared with whites, Asians, and 2000). These efforts are beginning to shift the framework Pacific Islanders—are more likely to attend lower-quality for educational research from asking how schools affect schools with fewer material and teacher resources, and learning to understanding how to maximize resources to are more likely to have lower test scores, drop out of promote achievement. What resources matter and what high school, not graduate from college, and attend lower- are the benchmarks of sustainable academic achievement? ranked programs in higher education” (Dabady, 2003, Educators, parents, and policymakers have long assumed p. 1048). It should not be too surprising, therefore, to find that schools and their attendant resources matter when it that minority students, particularly African American comes to student achievement. Yet the past two decades of and Hispanic youngsters, lag behind on standardized test research tell us that the relationship between schooling and scores. Indeed, achievement test scores from the National student achievement is neither direct, nor easily understood. Assessment of Educational Progress (U.S. Department Schools across the United States are organized in different of Education, NCES, 2002), as well as a number of other ways and they use resources differently. The effects of large-scale assessment programs like the SAT, indicate these organizational and structural differences on student that African American and Hispanic students score much achievement are made more complex when we recognize, as lower than their white and Asian American classmates in we must, the variety of individual differences in background reading, mathematics, and science (College Board, 2003; and ability that accompany students in our schools National Center for Education Statistics, 2002). A host of Relying on advances in statistical methods, in other academic achievement indicators show similar gaps particular multilevel covariance analysis or structural (Camara and Schmidt, 1999; Jencks and Phillips, 1998; equation modeling, we set out to explore the complex Mickelson, 2003). relationships and interactions among individual differences 1 in socioeconomic background, gender and ethnicity, and of these difficulties, see Bryk and Raudenbush, 1992; academic ability as they influence performance on a Goldstein, 1987; Lee, 2000; Millsap, 2002; and Snijders high-stakes standardized test, the SAT. By introducing and Bosker, 1999.) To resolve these difficulties, multilevel multilevel models, we attempted to isolate structural models were developed that allowed for the variance differences in secondary schools—i.e., school effects—on attributable to the school level to be partitioned from SAT scores. We used a matched comparison group design the variance associated with individual differences in to develop our evidence. Our research goals were (1) to students, permitting the estimation of more accurate, better understand the correlates of the SAT score gaps less biased, standard errors and more reliable and useful across subgroups of examinees based on gender and race/ information about between- and within-school effects. ethnicity; (2) to develop explanatory models (single and More recently, multilevel structural equation modeling multilevel) that fit reasonably well the SAT background (SEM) has become a popular methodology for studying and performance data across all subgroups; and (3) to complex phenomena in the social and behavioral identify the differential effects of family background, sciences, and is a vigorous line of methodological research academic achievement, and school-level variables on SAT (Raudenbush and Sampson, 1999). Generally speaking, performance. SEMs are simultaneous equation models that permit The remainder of this report is divided into four sections. researchers to build and test models containing both To begin, a brief overview is provided of multilevel latent- endogenous and exogenous latent variables, which, as in variable modeling methods, arguing for their particular factor analysis, can be represented by multiple indicators, advantages when complex, multivariate, multilevel i.e., multiple observed variables. Within this framework, questions are addressed. The second section describes the SEMs combine both measurement and structural models. data analyzed in this study—highlighting when and how The measurement model sets forth the relations (i.e., the data were gathered, and how the data were structured factor loadings) between the latent constructs and their to model latent variables. The third section of the report observed indicators, along with the unique or error variance provides detail on the results of the model fitting at both associated with each observed variable. The structural the single and multilevel stages. The report concludes model, in contrast, describes the directional structural with a discussion of what multilevel models suggest about relationships among the latent variables. Longford (1993) the relative contributions of individual differences in provided maximum likelihood estimation methods for students’ backgrounds, abilities, academic experiences, two-level regression models with latent variables, each with and secondary school characteristics to performance on multiple indicators. Others, e.g., Muthen (1984), Muthen the SAT verbal and mathematical reasoning tests. and Muthen (1998), and McDonald, (1993), extended these estimation methods to address problems of missing data at either level, and made available specialized software for Overview of Multilevel conducting multilevel covariance structure analysis. This family of statistical techniques blends path Latent-Variable analysis and factor analysis in a framework for assessing causal models (Hoyle, 1995; Loehlin, 1992). As such, Models SEM is a very general, largely linear statistical modeling technique, and a powerful tool for representing a network of hypothesized linear relations among a complex set Over the past decade or so, multilevel modeling of variables aggregated or nested at multilevels. This techniques have been developed to investigate school approach is particularly well suited for our study because effects, which, by their nature, are hierarchical (or of the large number of observed variables in our model, multilevel) because students are nested within schools. and our interest in linking performance on the SAT with “School effects,” then, refers to influences of school school characteristics. qualities and characteristics on student achievement, In SEM each measured variable is expressed as a linear e.g., test scores, or other educational outcomes. At the function of one or more common factors, and a single time when Coleman (1966) was studying school effects unique factor. The common factors include influences on and until the mid-1980s, it was common to investigate the measured variables that are shared among two or more school effects using statistical regression methods or such variables. For example, in the set of questions designed analysis of variance. There are obvious conceptual and to measure achievement in high school across a variety of methodological difficulties in using these methods, not academic subjects, we hypothesized that students’ responses the least of which was the troublesome confounding share a single common factor: academic achievement. The of variability at the individual student level with the unique factors include influences that are specific to each variation within and between aggregated levels (e.g., measured variable, such as random measurement error, schools) of the data. (For a more technical discussion or systematic components such as method influences. 2 Both the common and unique factors are denoted as African Americans, Hispanics, Asian Americans, and latent variables in SEM, but interest focuses mainly on whites. Specifically, we asked whether the observed score the common factors, and so references to latent variables differences on the SAT remained after controlling for family often are meant to apply to the common factors. Relations background, course-taking opportunities, and academic among the common factors are, in turn, represented by a achievement. Moreover, we asked if the introduction of set of path equations that capture the hypothesized causal high school characteristics, e.g., size of the school, the paths among these latent variables. The common factors percent minority, its location (urban, rural, suburban), and are intended to represent the constructs of interest, in the percent of students eligible for free or reduced lunch this case academic achievement, extracurricular activities, (an indicator of a school’s socioeconomic status) would socioeconomic background, and SAT scores. contribute to the further reduction of group differences on The equations formally resemble regression equations, SAT scores. with each equation expressing the modeled value of a criterion or endogenous variable as a linear function of one Student-Level Data or more predictor variables, plus a residual or disturbance Our data come from a subset of college-bound seniors term. Unlike ordinary regression equations, however, path who took the SAT during their junior or senior year of model equations are conceived as explicitly causal, with high school, and who graduated from high school in the regression or path coefficients representing the direct 1995. This cohort of 1.14 million students had mean causal influence of the predictor variable on the endogenous SAT verbal and mathematical scores of 504 and 506, variable. In path analysis, predictor variables may respectively, on the recentered SAT scale (Dorans, 2002). themselves be endogenous in relation to other predictors, They represent about 41 percent of all the high school or may serve purely as predictors. Variables of the latter seniors in the United States in 1995. Girls make up about type are denoted as exogenous variables in path analysis, 54 percent of this group, and the cohort is largely white while all other variables are regarded as endogenous. The (69 percent), with 11 percent African American, 8 percent path model represents a causal theory for the endogenous Asian American, 4 percent Mexican American, 4 percent variables. Causal influences on the exogenous variables are other Latinos, 1 percent Native American, and 3 percent not directly modeled. The challenge facing us was to specify who marked “other” when noting their race or ethnicity. a model that expressed how extracurricular academic Because our analyses focus on subgroup differences in activities, family background, and academic achievement SAT scores, Table 1, below, displays the mean SAT verbal all influence performance on the SAT—and determine if and mathematical scores disaggregated by race/ethnicity the hypothesized model represents performance across all and gender for this cohort of college-bound students. subgroups of students—while simultaneously examining The magnitude of group differences in SAT scores the independent effects of the schools’ qualities and is clear. Males outperform females in mathematics, and characteristics on test performance. white and Asian American students, in general, score higher on both the verbal and mathematical SAT tests Method than African American and Hispanic students. Background Data As we noted earlier, this study was animated by a concern When students register with the College Board to sit about the persistent achievement gap between minority and for the SAT, they complete a lengthy questionnaire, nonminority students on large-scale standardized tests like answering 43 questions about their high school courses, the SAT, and by related worries that perhaps standardized tests like the SAT are biased measures of cognitive ability. Always mindful of the policy issues related to school Table 1 improvement, accountability, and the achievement gap, we SAT Verbal and Mathematical Scores by Gender wanted to disentangle the role of individual differences in and Race/Ethnicity academic achievement from other possible contributors— Asian African including, for example, socioeconomic status, educational Whites Americans Americans Hispanics opportunities, and school effects—on test performance. Males Using data collected from the College Board’s Student SAT–M 551 577 444 495 Descriptive Questionnaire, which is administered annually SAT–V 537 533 443 484 to students taking the SAT, we developed and tested a series of multilevel latent-variable models and fit them Females to the SAT data. We attempted to fit our complex model SAT–M 515 543 427 462 to eight groups of students—the male-female subsets of SAT–V 534 531 448 478 3 participation in a sweep of extracurricular activities, Table 2 academic achievement levels (i.e., grades), parental Number of Students Responding to the Survey by education, family income, and their race or ethnicity Gender and Ethnicity (see www.collegeboard.com for a copy of the Student Males Females Total Descriptive Questionnaire). Responses to these questions formed much of the data for this study. For this study, Whites 170,270 212,412 382,682 we excluded students who reported that they were not African 18,411 27,644 46,055 U.S. citizens, and for whom English was not their first Americans language. In addition, we also excluded students who had Asian 12,333 13,732 26,065 not attended a public high school in the United States at Americans the time of testing. We also excluded students who were Hispanics 13,026 16,666 29,692 missing responses on the variables of interest in this Total 214,040 270,454 484,494 study. Table 2 shows the number of students enrolled in public high schools, disaggregated by race/ethnicity and gender, responding to all the relevant questions in the College Board survey, thus comprising our sample. This subset of more than 484,000 students provided the data Table 3 used in the subsequent analyses. High School Achievement Variables We chose carefully and empirically the measured variables to serve as indicators of each hypothesized HSAVG High School Grade Point Average latent construct, recognizing, too, that these self-report CRANK High School Class Rank measures are imperfect and often unreliable. At this ARTGR GPA in Art and Music Courses first level, the student level, we were interested in three SOCGR GPA in Social Science and History latent variables—the family socioeconomic background, ENGR GPA in English Courses academic achievement in high school, and academically related extracurricular activities—and the SAT verbal and LANGR GPA in Foreign Language Courses mathematical scores. MATHGR GPA in Mathematics Courses The College Board questionnaire provided the data SCIGR GPA in Natural Science Courses used to construct the latent variables. For example, students were asked to indicate the total number of years they took high school courses in specific subject areas, and to report their grade point average (GPA) on a scale of A to F for each academic subject. These data elements were used to Table 4 model academic achievement, and they are presented in Table 3, at right. Extracurricular Activities Variables Similarly, students indicated their participation in a ACTCNT Number of Extracurricular Activities (pursued for range of extracurricular activities. Table 4 provides the at least 3 years) complete list of variables we used to model participation APCNT Number of AP® Exams Intended in academic and nonacademic extracurricular activities HNRCNT Number of Honors Classes Taken while in high school. ENGCNT Number of Literature Experiences Students in our sample also reported their best estimates of annual family income in increments of COMPCNT Number of Computer Experiences $10,000, with reporting categories ranging from a low ARTCNT Number of Art, Music, and Theater Experiences of $10,000 to a maximum of $100,000 or more per year. In addition, they reported the highest level of education attained by both parents. These three variables were used to model students’ socioeconomic backgrounds. See Table 5, at right. And finally, in addition to these self-report measures, each student’s SAT verbal and mathematical scores Table 5 (reported on a scale from 200 to 800) were used as the Family Socioeconomic Background Variables outcome measures in our analyses. (The means for all FATHED Father’s Education Level measured variables at the student level, broken down by MOTHED Mother’s Education Level gender and ethnicity, are presented in the Appendix). FAMINC Combined Parental Income 4 School-Level Data analyses using multiple groups defined by ethnicity and gender. Instead, variables were created to represent ethnic Four school-level variables were selected from the 1994 group membership, gender, and the interaction of ethnicity NCES database for public secondary schools in the and gender. These variables were then included in the model United States (National Center for Education Statistics, as exogenous measured variables. This model structure 1994). These variables were limited in number and were permitted an evaluation of ethnic and gender effects on based on preliminary correlations with SAT verbal and SAT performance within a school after adjustment for the mathematical scores. The four selected variables were (1) modeled influences on the SAT. the number of students in the high school eligible for free or reduced lunch (FLE), a measure of the socioeconomic status of the students at that school; (2) the total number Results of students in the high school (MEMBER), an index of the size of the school; (3) the proportion of minority students in the school (PNWHIT); and (4) the location Here we describe the two modeling stages that characterize of the school (LOCALE) on a seven-point scale ranging our study. We also include descriptions of the models from rural to suburban to urban. These school-level tested and present our preliminary conclusions. We begin characteristics were merged with the student-level data with stage one, the student-level model. using common codes on both the College Board and NCES databases, resulting in a database of 8,258 high Stage One: schools and 288,066 students nested within those schools Student-Level Analyses and available for multilevel analysis. At this first level we examined the relationships among Our Multilevel and influences of socioeconomic background, academic achievement, and extracurricular activity levels on high Modeling Approach school students’ verbal and mathematical SAT scores. As we said earlier, our analytic approach relied on We looked at these relationships across eight subgroups multilevel structural equation modeling (SEM). This of students, based on ethnic group membership and approach is particularly well suited for this study because gender. Our explanatory models were developed and of the large number, 19 in all, of observed variables in tested against the SAT verbal and mathematical scores our model, and our interest in linking student-level data, of students in all eight subgroups. We proceeded in as well as school-level data, with performance on the three broad stages: (1) specifying a model that relates the SAT. The structural equation modeling of the combined variables to one another; (2) estimating the parameters student and school data proceeded in two broad stages. of the model; and (3) estimating how well the model fits In the first stage, a nested series of structural equation the empirical data, that is, how well the theoretical model models were fit to the student-level data, omitting replicates the empirical correlations between and among the school-level data. For these single-level analyses, the variables included in our database. multiple student groups were created based on ethnic Specifying the model required us to translate the group membership and gender, yielding eight groups theory we wished to test, in this case the relationship (Asian American males and females, Hispanic males between latent variables and SAT scores, into a particular and females, African American males and females, and structural model that could be derived and tested given white males and females). All models in this stage were the empirical data on hand. Thus, the resulting models, we fit simultaneously in these eight groups, permitting tests hoped, would not be refuted by our data. At the parameter of ethnic and gender invariance of the path coefficients in estimation stage we used the College Board data to derive the structural and measurement models. Mean structures estimates of the model parameters—i.e., the coefficients were included in all models. The use of means permitted calibrating the relationships among the variables—deemed tests for group differences in mean SAT performance optimal by one or more statistical estimation methods. after adjustment for modeled influences on the SAT. And finally, to evaluate the fit of our model, we used the The second stage examined a multilevel structural derived parameter estimates to examine how well the equation model that included influences of the school-level hypothesized model reproduced the covariation found in variables on SAT performance at the school level, using the empirical data. nearly the same student-level structural model as in the first stage. The new feature was that ethnic and gender Model Specification differences on the latent variables were specified as direct This step began with a theory about the relationships among effects using contrast-coded group definitions for the ethnic the variables under study. For convenience, a distinction and gender subgroups. Hence, these were not simultaneous was made between the measurement and structural 5

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.