ebook img

ERIC ED504060: New Measures of English Language Proficiency and Their Relationship to Performance on Large-Scale Content Assessments. Issues & Answers. REL 2009-No. 066 PDF

0.76 MB·English
by  ERIC
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview ERIC ED504060: New Measures of English Language Proficiency and Their Relationship to Performance on Large-Scale Content Assessments. Issues & Answers. REL 2009-No. 066

& ISSUES ANSWERS REL 2009 – No. 066 New measures of English language At Education Development proficiency and Center, Inc. their relationship to performance on large-scale content assessments U.S. D e p a r t m e n t o f E d u c a t i o n & ISSUES ANSWERS REL 2009–No. 066 At Education Development Center, Inc. New measures of English language proficiency and their relationship to performance on large-scale content assessments January 2009 Prepared by Caroline E. Parker Education Development Center, Inc. Josephine Louie Education Development Center, Inc. Laura O’Dwyer Boston College U.S. D e p a r t m e n t o f E d u c a t i o n WA ME MT ND VT OR MN NH ID SD WI NY MA WY MI CTRI IA PA NE NV IL IN OH CA UT CCOO WV VA KS MO KY NC TN AZ OK NM AR SC MS AL GA LA TX AK FL VI At Education Development PR Center, Inc. Issues & Answers is an ongoing series of reports from short-term Fast Response Projects conducted by the regional educa- tional laboratories on current education issues of importance at local, state, and regional levels. Fast Response Project topics change to reflect new issues, as identified through lab outreach and requests for assistance from policymakers and educa- tors at state and local levels and from communities, businesses, parents, families, and youth. All Issues & Answers reports meet Institute of Education Sciences standards for scientifically valid research. January 2009 This report was prepared for the Institute of Education Sciences (IES) under Contract ED-06-CO-0025 by Regional Educa- tional Laboratory Northeast and Islands administered by Education Development Center, Inc. The content of the publica- tion does not necessarily reflect the views or policies of IES or the U.S. Department of Education nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. This report is in the public domain. While permission to reprint this publication is not necessary, it should be cited as: Parker, C. E., Louie, J., and O’Dwyer, L. (2009). New measures of English language proficiency and their relationship to per- formance on large-scale content assessments (Issues & Answers Report, REL 2009–No. 066). Washington, DC: U.S. Depart- ment of Education, Institute of Education Sciences, National Center for Education Evaluation and Regional Assistance, Regional Educational Laboratory Northeast and Islands. Retrieved from http://ies.ed.gov/ncee/edlabs. This report is available on the regional educational laboratory web site at http://ies.ed.gov/ncee/edlabs. Summary REL 2009–No. 066 New measures of English language proficiency and their relationship to performance on large-scale content assessments Using assessment results for 5th and 8th along with other traditionally underserved grade English language learner students student subgroups, to proficiency on statewide in three Northeast and Island Region assessments. states, the report finds that the English language domains of reading and writ- In response to a request from New Hampshire, ing (as measured by a proficiency as- Rhode Island, and Vermont to explore how sessment) are significant predictors of English language proficiency measures may be performance on reading, writing, and related to performance outcomes on content mathematics assessments and that the assessments, this report uses the results of two domains of reading and writing (literacy new large-scale assessments—the Assessing skills) are more closely associated with Comprehension and Communication in Eng- performance than are the English lan- lish State-to-State for English Language Learn- guage domains of speaking and listening ers (ACCESS for ELLs) English proficiency (oral skills). assessment and the New England Common Assessment Program (NECAP)—to address As the English language learner population the following research question: grows throughout the Northeast and Islands Region, state departments of education are How does performance in four language seeking assistance in creating comprehen- domains on an English language profi- sive approaches to meeting English language ciency assessment predict English lan- learner students’ academic needs in both guage learner students’ performance on a instruction and assessment. Driving educa- state content assessment after accounting tor concerns is the fact that English language for student and school characteristics? learner students consistently score lower on state assessments than students for whom Based on findings from previous research, English is their first language. In the context of this report hypothesized that after controlling the No Child Left Behind Act of 2001 (NCLB), for individual student characteristics such states are seeking information to inform as gender, poverty status, disability status, their efforts to reduce achievement gaps and race/ethnicity, age for grade, and years in to bring English language learner students, English language learner programs as well as ii Summary for school characteristics such as school size, 23 percent in 8th grade after controlling school poverty, racial composition, English for student and school covariates. language learner student density, and geogra- phy, measures of academic English language • NECAP writing scores in 5th grade were proficiency would predict English language significantly and positively predicted by learner student outcomes on state content as- ACCESS reading and writing scores and sessments. The report also hypothesized that in 8th grade by all four ACCESS domain measures of English language literacy (reading scores after controlling for other ACCESS and writing) would be stronger predictors of scores and student and school character- content assessment outcomes than would mea- istics. ACCESS reading and writing scores sures of English oral proficiency (listening and were the strongest predictors of NECAP speaking).1 writing outcomes in 5th and 8th grades. ACCESS domain scores explained 28 per- To test these hypotheses, multilevel regression cent of the variance in NECAP writing models were fit to assessment score data for scores in 5th grade and 25 percent in 8th 5th and 8th grade English language learner grade after controlling for other covariates. students in New Hampshire, Rhode Island, and Vermont. After controlling for student and • Like NECAP reading and writing scores, school characteristics, English language profi- NECAP mathematics scores in both 5th ciency scores (as measured by ACCESS) were and 8th grades were positively and sig- indeed significant predictors of content assess- nificantly predicted by ACCESS reading ment outcomes (as measured by the NECAP). and writing scores after controlling for The models also showed that after accounting other ACCESS scores and student and for other covariates, ACCESS measures of Eng- school characteristics. Among the ACCESS lish literacy were significantly stronger predic- domain scores ACCESS reading scores tors of NECAP outcomes than were ACCESS were the strongest predictor of NECAP measures of oral proficiency. Specifically, this mathematics outcomes for both 5th and report finds that: 8th grade English language learner stu- dents, followed by ACCESS writing scores. • NECAP reading scores in both 5th and 8th ACCESS domain scores explained 21 grades were significantly and positively percent of the variance in NECAP math- predicted by ACCESS reading, writing, ematics scores in 5th grade and 14 percent and speaking scores after controlling for in 8th grade. other ACCESS scores and student and school characteristics. Among the ACCESS • ACCESS reading and writing scores were domain scores the strongest predictor of significant predictors of NECAP reading, NECAP reading outcomes was ACCESS writing, and mathematics scores in 5th reading scores, followed by ACCESS writ- and 8th grades. ACCESS speaking and ing and speaking scores. ACCESS domain listening scores were significant predictors scores explained 30 percent of the variance of NECAP scores for only four outcomes: in NECAP reading scores in 5th grade and 5th and 8th grade reading (speaking), 8th Summary iii grade writing (speaking and listening), In 5th and 8th grades, ACCESS scores ex- and 5th grade mathematics (listening). plained 14–30 percent of the variance in scores for all three NECAP content scores (reading, In sum, ACCESS measures of English literacy writing, and mathematics) after controlling skills (reading and writing scores) were signifi- for background student and school charac- cant predictors of NECAP reading and writ- teristics. The ACCESS scores explained more ing outcomes in 5th and 8th grades. Notably, of the variance in 5th grade (from 21 percent ACCESS reading and writing scores were also of NECAP mathematics scores to 30 percent positive and significant predictors of NECAP of NECAP reading scores) than in 8th grade mathematics scores. In addition, except for 8th (from 14 percent of NECAP mathematics grade writing, ACCESS reading and writing scores to 25 percent of NECAP writing scores). scores were significantly stronger predictors of NECAP outcomes than were ACCESS listening January 2009 and speaking scores. This evidence supports the original hypothesis that ACCESS measures Note of English literacy skills are better predictors of NECAP content outcomes than are ACCESS 1. In this report “stronger” predictors are de- fined as those whose regression coefficients are measures of English oral skills (listening and larger than those of other noted predictors in speaking). Readers are cautioned, however, the study’s regression models. A predictor is that the analyses and interpretations presented “significantly stronger” than another predic- tor when the difference between the regression are correlational and therefore do not allow coefficients is greater than zero at the p < 0.05 causal conclusions. level. iv TablE of coNTENTs Why this study? 1 Regional need 1 Research question and conceptual framework 3 How does performance in four language domains on an English language proficiency assessment predict English language learner students’ performance on a state content assessment after accounting for student and school characteristics? 5 Predictors of NECAP outcomes in reading, writing, and mathematics 8 Predicted changes across NECAP outcomes for each ACCESS domain 11 Discussion, future research, and study limitations 12 Additional observations and topics for future research 12 Study limitations 13 Appendix A Review of the literature 15 Appendix B Methods of analysis 18 Appendix C About the data 20 Appendix D Descriptions and reliability estimates for New England Common Assessment Program and Assessing Comprehension and Communication in English State-to-State 28 Appendix E Confidence intervals for testing differences 31 Appendix F Multilevel modeling procedures 33 Appendix G New England Common Assessment Program models 35 Notes 47 References 49 Boxes 1 Definitions of key terms 2 2 Methodology 4 Figure 1 Conceptual framework: acquiring language of instruction and demonstrating knowledge, skills, and abilities on content assessment 3 Tables 1 NECAP scores regressed on different student ACCESS scores and student and school characteristics, 2006 6 2 Percent of additional and total variance in 5th and 8th grade NECAP scores explained by the three models, 2006 8 C1 NECAP data for English language learner students with 4th grade ACCESS data, 2006 21 C2 NECAP data for English language learner students with 7th grade ACCESS data, 2006 21 Table of conTenTS v C3 5th and 8th grade English language learner dataset, before and after imputation and deletion of cases with missing data, 2006 22 C4 Characteristics of English language learner students from New Hampshire, Rhode Island, and Vermont in the 5th and 8th grade samples, 2006 24 C5 Model variables and their scales 25 C6 Summary statistics of continuous variables used in models, by grade, 2006 27 D1 Reliability estimates for ACCESS subscale scores 29 D2 Population reliability estimates for NECAP outcome measures 29 D3 English language learner student subgroup reliability estimates for NECAP outcome measures 29 E1 0.95 confidence interval around regression coefficients, by grade level and NECAP content area (within models), 2006 31 E2 0.95 confidence interval around regression coefficients, by content areas (across 5th and 8th grade models), 2006 32 G1 Predictors of 5th grade NECAP reading scores, 2006 35 G2 Predictors of 5th grade NECAP writing scores, 2006 37 G3 Predictors of 5th grade NECAP mathematics scores, 2006 39 G4 Predictors of 8th grade NECAP reading scores, 2006 41 G5 Predictors of 8th grade NECAP writing scores, 2006 43 G6 Predictors of 8th grade NECAP mathematics scores, 2006 44 Why ThiS STudy? 1 Why ThIs sTUdy? Using assessment results for 5th and 8th As the English language learner population grows throughout the Northeast and Islands Region, grade English language and as achievement gaps persist between English language learner students and native English learner students in speakers, state education agencies are creating comprehensive programs to meet English language three Northeast and learner student needs. With more than one in five Island Region states, school-age children in Rhode Island speaking a language other than English at home (Kids Count the report finds that Data Center 2006), the Rhode Island Department of Education and the Governor’s PK–16 Council the English language have identified educating English language learner students as a priority. And in New Hampshire and domains of reading and Vermont, where English language learner popula- writing (as measured tions are smaller and more isolated, state education agencies are looking for efficient ways to meet these by a proficiency students’ needs. New Hampshire, for example, has recently requested assistance from regional educa- assessment) are tion support centers to define and monitor services for English language learner students. significant predictors Regional need of performance on reading, writing, In the context of the No Child Left Behind Act of 2001 (NCLB), Northeast and Islands Region states and mathematics want technical assistance and targeted data analysis to inform their efforts to reduce achievement gaps assessments and and to bring English language learner students, that the domains of along with members of other traditionally under- served student subgroups, to proficiency on state- reading and writing wide assessments. English language learner students consistently score lower on state assessments than (literacy skills) are native English speakers, often by as many as 20–30 percentage points (Abedi and Dietel 2004). The more closely associated reasons for such low performance are varied and with performance complex, not least of which is that English language learner students are learning content (mathematics, than are the English science, reading, and writing) and are being assessed in these content areas while they are learning the language domains academic English that is the medium for classroom learning (see box 1 for a definition of key terms). of speaking and listening (oral skills). To better understand the learning needs of English language learner students, New Hampshire, Rhode Island, and Vermont have been administering a new 2 engliSh language proficiency and performance on large-Scale conTenT aSSeSSmenTS box 1 and extract information, and fol- interitem correlations. Values range Definitions of key terms low the instructional discourse from 0 to 1. Estimates of 0.7 or higher through which teachers provide indicate optimal reliability. Academic English. Researchers dis- information. Scale score. A scale score is a test score tinguish between social English and • Writing. The ability to produce that has been converted from a raw the academic English needed to learn written text with content and score (such as a number correct) to a academic content. Academic language format fulfilling classroom number on a common scale indicat- uses different vocabularies, types assignments at age- and grade- ing a student’s performance. NECAP of syntax, and levels of classroom appropriate levels. scale scores range from 500 to 580 for discourse and involves abstract forms grade 5 and from 800 to 880 for grade of language needed to communicate in • Speaking. The ability to use oral 8 in all content areas. ACCESS scale formal, often decontextualized, situa- language appropriately and scores range from 100 to 600. tions and may be needed for successful effectively in learning activities navigation of classroom learning and (such as peer tutoring, collab- Standard deviation. Standard devia- large-scale assessments. (For more de- orative learning activities, and tion is a measure of how widely or tail on the literature, see appendix A.) question and answer sessions) narrowly data are dispersed around within the classroom and in the mean for the distribution. For ex- English language learner student. Al- social interactions within the ample, the standard deviation of a set though definitions vary, the Council school (Council of Chief State of student test scores is calculated by of Chief State School Officers defines School Officers 1992). summing the squared deviations of an English language learner student each student’s individual score from as a student with a language back- Multilevel regression modeling. A set of the mean, dividing this sum by one ground other than English and whose regression-based procedures used to minus the total number of students, proficiency in English is such that the analyze data with a nested or hier- and taking the square root of the re- probability of the student’s academic archical structure (such as students sulting number. A student’s test score success in an English-only classroom nested within schools). When used can be described in terms of standard is below that of an academically suc- with nested data, multilevel regres- deviation units by subtracting the cessful peer with an English language sion modeling allows correct standard mean from the student’s score and background (Council of Chief State errors to be calculated, allows the dividing that figure by the standard School Officers 1992). relationship between the independent deviation. and dependent variables to vary across English language proficiency. Al- groups, and allows individual and Standard error. Standard error is though definitions vary, the Council group characteristics to be included in a measure of the amount of error of Chief State School Officers defines models for predicting individual out- between an estimated statistic from a fully English proficient student as a comes (Raudenbush and Bryk 2002). a sample and the true statistic for the student who is able to use English to population. For example, the mean ask questions, to understand teachers Reliability estimate. Reliability is the test score for a sample of students will and reading materials, to test ideas, consistency of measurement. A reli- have a standard error that estimates and to challenge what is being asked ability estimate is a number calcu- the deviation between the sample in the classroom. Four language skills lated to represent the consistency of mean and the mean for the entire contribute to proficiency: scores provided by a measurement in- student population. The standard strument. The reliability estimates re- • Reading. The ability to compre- error for a sample mean is calculated ferred to here are internal consistency hend and interpret text at the age by dividing the standard deviation of estimates of reliability (Cronbach’s α). and grade-appropriate level. the sample data by the square root of Calculating internal consistency the number of subjects in the sample. • Listening. The ability to under- reliability estimates requires only stand the language of the teacher one administration of the measure- Variance. Variance is the standard and instruction, comprehend ment tool and is calculated from the deviation squared.

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.