Education DriversOverview
Decision Making- Overview
- Best Available Evidence- Continuum of Evidence
- Types Of Evidence
- Sources Of Evidence
- Professional Judgment- Sources Of Bias
- Improvement
- Practice Based Evidence
- Contextual Fit
- Client Values
- Treatment Integrity- Dimensions
- Strategies
- Problem Solving
Implementation- Overview
- Exploration/Adoption
- Installation
- Initial Implementation
- Full Implementation
- Social Influence
- Culture Change
- Values
Monitoring- Overview
- Student- Student Formative Assessment
- End of Course Exams
- Standardized Tests
- Grades
- Early Indicators
- Staff- Value Added
- Formal Evaluation
- Informal (Walk Throughs)
- Feedback (Coaching)
- Treatment Integrity
- Systems- Short Term (Formative)
- Long Term (Summative)
- Program Fidelity
External Influences- Overview
- Home- Parenting
- Home Schooling
- Homework
- Poverty- Impact
- Interventions
- Cultural Diversity- Issues
- Solutions
- Policy- Political Policy
- Governmental Agencies
- Special Interests
- Society- Communities
- Media
- Common Frames
- Standards- Societal Outcomes
- Academic Standards
- Evaluation
Quality Teachers- Overview
- Competencies- Formative Assessment
- Classroom Management
- Instructional Delivery
- Soft Skills
- Outreach- Projected Need
- Teacher Standards
- Outreach
- Selection
- Retention- Teacher Turnover Impact
- Teacher Turnover Analysis
- Retention Strategies
- Teacher Preparation- Curriculum Content
- Instructional Effectiveness
- Student Teaching
- Teacher Preparation: Models
- Program Accountability
- Professional Development- Induction
- Inservice
- CEUs / Advanced Degrees
- Certification / Licensing
- Coaching
- Evaluation- Formal Evaluation
- Performance Feedback
- Coaching
Quality Leadership- Overview
- Principal Impact
- Principal Competencies- Teacher Development
- Goals and Expectations
- Ensuring Quality Teaching
- Resourcing Strategically
- Orderly Safe Environment
- Principal Outreach- Needs Analysis
- Principal Standards
- Principal Outreach
- Principal Selection
- Principal Retention- Turnover Impact
- Turnover Analysis
- Retention Strategies
- Principal Preparation- Curriculum Content
- Instructional Effectiveness
- Clinical Practice
- Program Models
- Program Accountability
- Professional Development- Induction
- In-Service Professional Development
- CEUs / Advanced Degrees
- Certification / Licensing
- Coaching
- Evaluation- Formal Evaluation
- Performance Feedback
- Coaching
- School Metrics
- Leadership Models- Teams
- Distributed Leadership
- District/State Leadership- School Districts
- State Education Agencies
Effective Instruction- Overview
- Assessment- Formative Assessment
- Summative Assessment
- Instructional Delivery- Planning
- Active Student Responding
- Corrective Feedback
- Differential Reinforcement
- Mastery Learning
- Quantity Of Instruction
- Classroom Management- Appropriate Behaviors
- Inappropriate Behaviors
- Rules And Procedures
- Structured Environments
- Active Supervision
- Soft Skills/Personal Competencies- Teacher-Student Relationships
- Personal Organization
- Personal Problem Solving
- Evidence-Based Curriculum- Instructional Standards
- Generic Practice Elements
- Subject Practice Elements
- Systematic Reviews
- Treatment Integrity- Dimensions
- Strategies
- Problem Solving
- Remote Learning- Infrastructure
- Curriculum
- Training And Support
- School Programs- Multi-Tiered Systems / Support
- Early Childhood Education
- Vocational Education
- Special Education
- School Climate
Education Resources- Overview
- Staff- Staff Quality
- Staff Support
- Staff Equitable Distribution
- Funding- Amount
- Return On Investment
- Equitable Distribution
- Materials- Curriculum Supplies
- Facilities
- Computer / IT

Value Added

Value-added modeling (VAM) is a statistical approach that provides quantitative performance measures for monitoring and evaluating schools and other aspects of the education system. VAM comprises a collection of complex statistical techniques that use standardized test scores to estimate the effects of individual schools or teachers on student performance. Although the VAM approach holds promise, serious technical issues have been raised regarding VAM as a high-stakes instrument in accountability initiatives. The key question remains: Can VAM scores of standardized test scores serve as a proxy for measuring teaching quality? To date, research on the efficacy of VAM is mixed. There is a body of research that supports VAM, but there is also a body of studies suggesting that model estimates are unstable over time and subject to bias and imprecision. A second issue with VAM is the sole use of standardized tests as a measure of student performance. Despite these valid concerns, VAM has been shown to be valuable in performance improvement efforts when used cautiously in combination with other measures of student performance such as end-of-course tests, final grades, and structured classroom observations.

OVERVIEW WING RESOURCES RESEARCH ORGANIZATIONS

Value-Added Research in Education: Reliability, Validity, Efficacy, and Usefulness

(Wing Institute Original Paper)

Value-Added Overview PDF

Cleaver, S., Detrich, R. & States, J. (2020). Overview of Value-Added Research in Education: Reliability, Validity, Efficacy, and Usefulness. Oakland, CA: The Wing Institute. https://www.winginstitute.org/staff-value-added.

One goal of education is to produce students who make measurable academic progress each year. National policy (e.g, No Child Left Behind) has been built around the idea that schools consistently produce students who demonstrate increasing mastery of content. One goal of education research is to understand what educational factors (e.g., teachers, students, class size) contribute to student learning. When it comes to identifying teacher effectiveness, the core question is, how much do test scores accurately reflect a teacher’s contribution to student learning?

The question of how teachers contribute to student achievement is crucial because teachers are the most important school-based factor in student achievement (Rivkin, Hanushek, & Kain, 2005; Sanders & Horn, 1998). Standardized test scores are related to how far students progress in school and how much they earn later in life (Chetty et al., 2011; Hanushek & Woessman, 2008; Lazear, 2003; Murnane, Willett, Duhaldeborde, & Tyler, 2000)[C1] [SC2] , so it is worth considering these types of data when looking at teacher impact. However, it has been difficult to distinguish between effective and less effective teachers in relation to raising student achievement (Toch & Rothman, 2008; Weisberg et al., 2009).

Currently, information on how teachers impact individual student results is used by teacher evaluation programs at the local and system levels. A teacher’s impact on student achievement may influence teacher retention, pay, and incentives. The problem is how to capture the growth that an individual teacher has on a student’s performance outside of other factors, such as socioeconomic status or class size, that could also impact those gains. Value-added modeling (also known as value-added measurement, value-added assessment) is an attempt to address that concern. Value-added modeling is a method of evaluation that attempts to measure a teacher’s contribution to student achievement in a given year by isolating the value added, or contribution, of the teacher and compare it to the value added of other teachers. These often involve analyzing data from multiple years and in comparison with other teachers’ data. The purpose of this paper on value-added research in education is to define this type of research, provide an overview of how it has been conducted, and discuss its benefits and limitations.

Ways to Measure Student Achievement

There are various ways to evaluate student achievement. Status models use the test scores of one group of students at one time. These compare the achievement of a group of students on one exam compared with the results that are expected on that assessment (Koretz, 2008). Cohort-to-cohort change models use change in achievement exams over time; for example, comparing the percentage of students who are proficient in an exam in one year with previous proficiency rates to see if there has been improvement (Koretz, 2008). In contrast, value-added models are based on individual student growth across a year of education. In this way, value-added models attempt to demonstrate student achievement, and thereby teacher effectiveness, by using information from the same group of students across time, something that the status and cohort-to-cohort change models cannot do (Koretz, 2008).

Value-added research in education involves using statistical models that control, or remove the influence, of some variables in order to isolate the effect of a teacher on his or her students’ learning (Steele, Hamilton, & Stecher, 2010). Value-added research involves statistical modeling that takes students’ test scores and, sometimes, school characteristics to create value-added scores for teachers and schools (Braun, 2015).

The purpose of value-added modeling is to show relative effectiveness in improving student scores. Value-added measures are also generally regarded as more effective than indicators based on student characteristics (e.g., student proficiency in a subject).

Value-Added Measures in Educational Research

Value-added measures use complicated formulas that take into account multiple factors (e.g., past and current test scores) to show how effective teachers are at producing student growth, while keeping other factors constant (e.g., assignment of students to classrooms) (David, 2010; McCaffrey & Lockwood, 2011).

When researchers measure a teacher’s ability to produce future achievement in students, they find large differences between teachers. Typical teacher quality variables (e.g., education level, licensure, experience) explain little of the variation in the results that teachers produce with students (Hanushek & Rivkin, 2010). Because observable teacher differences rarely explain teacher quality, other aspects are necessary to determine teacher quality.

Koretz (2008) has argued that, in addition to looking at how much students learn, value-added measures should take into account students’ rate of learning, or how quickly they master new skills or learn new information. Students advance through school with various student characteristics (e.g., disability, initial aptitude) and external factors (e.g., parental income) that impact their rate of learning. For example, students who know more about a topic learn new information at a faster rate than those who don’t know as much about the topic. Value-added measures, including models that take multiple years of historical student data into account, can use that information to determine a teacher’s effectiveness.

History of Value-Added Measures in Education

Because teacher evaluation efforts tend to focus on rewarding teachers for their contribution to student learning, it is necessary to build evaluation systems that measure student performance (Steele et al., 2010). These efforts, combined with competitive federal programs (e.g., Race to the Top) and philanthropic efforts (e.g., Bill and Melinda Gates Foundation’s Empowering Effective Teachers [EET] initiative) are shaping how state and districts recruit, evaluate, reward, and develop teachers (Steele et al., 2010). One notable example of value-added measures is the Tennessee Value-Added Assessment System (TVAAS).

Tennessee Value-Added Assessment System (TVAAS)

In Tennessee, education reform started with the 1984 Comprehensive Education Reform Act, which, among other efforts, provided merit pay. At the same time, two statisticians at the University of Tennessee were working to gauge the feasibility of a statistical model that would eliminate the impediments (e.g., missing student records, different modes of teaching, teacher turnover) to using student achievement data to understand education outcomes of students in grades 2 through 5 in Knox County, Tennessee. The findings indicated the following:

There were measurable differences among schools and teachers in regard to student learning.
The effects of school and teachers on student achievement were consistent from year to year.
Teacher effects were not school specific, meaning that a gain could not be predicted based on the location of the school.
There was a strong positive correlation between the teacher’s effect on student achievement and the teacher’s principal evaluation, even though the evaluations were more subjective.
Student gains were not related to the initial ability or achievement levels of the students at the start of the school (McLean & Sanders, 1984).

Future research on TVAAS indicated that academic gains were unrelated to socioeconomic status (i.e., free and reduced lunch), race or ethnicity, or the mean achievement level of the school (Sanders & Horn, 1998).

The focus on growth (i.e., using students’ information across years and creating a model that made the students their own comparison group) gave researchers the ability to see how students grew over time and how much of the growth was produced by teacehrs (Sanders & Horn, 1998). This study, together with other studies that replicated the findings, indicated that statistical models could be used to isolate the impact of teachers on student progress.

Challenges of Value Added Research

Challenges in value-added research relate to trustworthiness, reliability, validity, and usefulness[RD3] [SC4] . These challenges also apply to other measures of student progress (Betebenner, 2009). However, when value-added models are used to make high-stakes decisions, like employment and pay incentives, it is important to have data that teachers and school leaders can trust and use effectively.

Trustworthiness Considerations

Research is deemed trustworthy when it to demonstrates value and allows for external judgments about procedures and findings, specifically that they are objective and unbiased.

Bias is a concern in value-added modeling. For example, Rothstein (2008) worried that test score gains are biased because they lack random assignment; students are not randomly assigned to teachers. Comparing teachers who have large numbers of students with behavior or learning concerns with teachers who have classrooms of higher achieving students is problematic as scores favor the teachers with higher achieving students.

There is also a concern that the choice of test can impact a teacher’s value-added score (McCaffrey, 2012). In analyzing value-added ratings of middle school teachers using two different math subtests, RAND researchers found discrepancies in teachers’ effectiveness depending on the subtest used (Lockwood et al., 2006). If score results are a function of the test used and not teacher behavior, this is a significant concern. It is important to choose tests that are a direct measure of what school leaders want to assess, and to be aware of any assessments that are imperfect and how they impact the interpretation of value-added measures (Lockwood et al., 2006; McCaffrey, 2012).

Stability over time is also an issue that impacts trustworthiness. In considering whether value-added analyses identify the same teachers as effective every year, Goldhaber and Hansen (2008) examined a large data set from North Carolina and found that estimates of teacher effectiveness in reading and math were not the same across years. Similarly, other researchers have questioned whether it is possible to compare gains from one year to the next using tests that may not include the same content (Koretz, 2008). This suggests that effectiveness is not a fixed quality and can vary over time depending, for example, on the makeup of a teacher’s class or the teacher’s years of experience.

Reliability Considerations

Reliability is the extent to which scores are consistent across repeated measures and are free of measurement errors (AERA, APA, & NCME, 1999). Put another way, reliability lies in the consistency of tests.

Internal reliability (or internal consistency) is the extent to which test items measure the same construct, or concept or topic, that is being investigated (Crocker & Algina, 1986). For example, tests might measure student mastery of math concepts or progress in reading. Internal reliability is also the degree to which similar results occur under consistent testing conditions. When tests have internal reliability, they are accurate and consistent from one testing session to the next. When expressed quantitatively, reliability scores that are above 0.8 are considered acceptable, and scores above 0.9 quite reliable. The higher the reliability score the better, although a minimum reliability score of 0.5 or 0.6 might be considered for some assessments. In their research, Steele et al. (2010) identified three reliability considerations related to value-added modeling:

Internal consistency of student assessment scores
Consistency of ratings by individuals scoring assessments
Consistency of the estimates of the value-added measures generated from student scores.

As educational research develops, larger longitudinal data sets are available to work with. These data have been used to evaluate educational inputs on student achievement. One concern is that missing data may impact the results of these value-added models. In one analysis of a large data set from an urban U.S. school district, analyzing data that included missing data (e.g., one year of student test scores) had little impact on the estimated teacher effects (McCaffrey & Lockwood, 2011).

Validity Considerations

Validity is how well the evidence from a test supports the interpretation of test scores and, in turn, the use of the test (AERA, APA, & NCME, 1999). In other words, it refers to the accuracy of an inference drawn from the results of a test or how well the assessment aligns with course content or what students are learning.

Concern about validity addresses the core questions of value-added modeling: To what extent do changes in students’ performance on an assessment reflect their accurate understanding of the content? How much do test scores accurately reflect the teacher’s contribution to student learning? Various aspects of instruction (e.g., teaching test taking strategies) may contribute to changes in performance on an assessment more than knowledge in content knowledge (e.g., Koretz, 2008; Koretz & Barron, 1998). This would artificially inflate student scores and impact teacher value-added scores.

Inconsistencies in the content of a test could impact the validity of an inference about student growth (McCaffrey, Lockwood, Koretz, & Hamilton, 2003). For example, gauging students’ growth in science knowledge overall by using tests in biology and then chemistry might lead to an invalid inference if the students have different levels of knowledge in the two sciences. This concern can also arise when students take tests in the same course, for example, two math tests focusing on different standards (Martineau, 2006).

Another threat to the validity of value-added modeling lies in attributing student performance to an individual teacher when the assessment covers material from multiple courses, for example, SAT or ACT tests that are cumulative across high school.

Usefulness Considerations

Usefulness relates to how value-added measures are received and used by teachers and school leaders. Principals have demonstrated skepticism about the usefulness of value-added measures compared with observational data, specifically in timing, validity, and utility for teachers (Goldring et al., 2015).

Teachers have also expressed concern about the lack of transparency of value-added scores (Jiang, Sporte, & Luppescu, 2015). In a study of Chicago Public Schools’ Recognizing Educators Advancing Chicago (REACH) program, which used both test scores and teacher observations, teachers—especially special education teachers—expressed concern about the reliance on test scores, which is a concern of value-added and other models (Jiang et al., 2015).

While teacher evaluation methods do have their fair share of concerns (Cleaver, Detrich, & States, 2018), the effort to find more objective evaluation systems has helped efforts to increase the usefulness of value-added measures. Value-added measures are one way to measure teacher effectiveness, and therefore may be useful for understanding teacher impacts over time. There are moderate correlations between value-added measures and other measures of teacher effectiveness (e.g., principal evaluations). For principal surveys, the correlation between value-added measures and principal surveys was 0.32 (Jacob & Lefgren, 2008). In another study, the correlation between value-added measure and principal assessment in teachers’ math scores was 0.41, and in teachers’ reading scores 0.44 (Harris & Sass, 2014). In their analysis of 201 teachers across grades 2 through 6, Jacob & Lefgren (2008) found that value-added models did a slightly better job of predicting future test scores, but both observation and value-added measures were able to predict which teachers would be in the top and bottom 20% the following year. These findings were replicated by studies of other evaluation systems across the country, for example, in Cincinnati and in Washoe County, Nevada (Milanowski, Kimball, & White, 2004).

Lingering Questions About Value-Added Modeling

No model answers all questions. In value-added modeling, the questions that remain include the following:

What does it mean to be a good teacher and how is that measured?
What outcomes do we want from education? For example, are there outcomes that cannot be measured on a standardized assessment that are just as important standardized test outcomes (Ruzek, Domina, Conley, Duncan, & Karabenick, 2015)?
Do good teachers succeed with all types of students or only certain types (Condie, Lefgren, & Sims, 2014)?
Is it meaningful to compare gains across grades; for example, across grades 3 and 6, even if the general content is the same (Everson, 2017)?

There are also limitations to how value-added modeling demonstrates the impact of teachers on students who are learning English or who have disabilities. Specifically, an analysis by the Carnegie Foundation found that the value-added results for teachers who teach inclusive classrooms change very slightly whether or not students with disabilities are included (McCaffrey & Buzick, 2014). However, they also identified the following concerns with applying value-added modeling to teachers who exclusively teach students with disabilities:

The scores of students with disabilities can be low which may attribute those lower scores to their teachers’ performance.
Lower student test scores can also increase the random errors in value-added models.
The use of testing accommodations from year to year can create variability in growth that may be attributed to teachers incorrectly.
States and districts that use value-added models may find it helpful to monitor the proportion of students with disabilities in classes along with other evidence of systematic error so they can revise the models as needed (McCaffrey & Buzick, 2014).

Recommendations for Value-Added Measures

The use of value-added measures is relatively new in education research. Because these measures are being used to make decisions that impact teachers, decision makers should carefully consider the following factors (Martineau, 2006):

Reliability of assessment measures used (AERA, APA, & NCME, 1999),
Validity of inferences drawn from value-added measures,
How well student assessments compare from year to year.

While value-added measures provide some useful information about differences in teacher performance, individual scores suffer from variance and low stability as well as undetermined bias (Braun, 2015). The American Statistical Association (2014) has recommended caution in using value-added measures for evaluation or high-stakes purposes. Because high-quality evaluations based on observation of teacher practice can provide information about teacher effectiveness, effort should be put into training teachers and principals in teacher evaluation (Braun, 2015). Both teacher observation and value-added measures provide usable information about teacher quality (Milanowski, Kimball, & White, 2004). However, a remaining question about value-added measures is whether high-quality, large-scale observation protocols can be achieved and maintained (Jiang et al., 2015).

Conclusion

Value-added models have expanded our ability to analyze teachers’ impact on student achievement. From value-added research, we know that there is variation in teacher performance (Aaronson, Barrow, & Sander, 2007; Rivkin et al., 2005) and that value-added models can capture the effect of teachers on student achievement (Hanushek & Rivkin, 2010; McCaffrey & Buzick, 2014; Sanders & Horn, 1998). There are concerns related to reliability, validity, efficacy, and usefulness that should be taken into account before designing and implementing a plan for evaluating teachers using value-added measures across a district. In addition, questions remain about the precision and practicality of value-added measures in schools, specifically whether value-added measures can address core questions about quality teaching, and the practicality of value-added modeling compared with observation protocols.

Citations

Aaronson, D., Barrow, L., & Sander, W. (2007). Teachers and student achievement in the Chicago Public High Schools. Journal of Labor Economics, 25(1), 95–135.

American Educational Research Association (AERA), American Psychological Association (APA), and National Council on Measurement in Education (NCME). (1999). The standards for educational and psychological testing.Washington, DC: AERA Publications.

American Statistical Association. (2014). ASA statement on using value-added models for educational assessments.Alexandria, VA: Author. Retrieved from https://www.amstat.org/asa/files/pdfs/POL-ASAVAM-Statement.pdf

Betebenner, D. (2009). Growth, standards and accountability. Dover, NH: National Center for the Improvement of Educational Assessment. Retrieved from https://www.nciea.org/sites/default/files/publications/growthandStandard_DB09.pdf

Braun, H. (2015). The value in value added depends on the ecology. Educational Researcher, 44(2), 127–131. Retrieved from https://doi.org/10.3102%2F0013189X15576341

Chetty, R., Friedman, J., Hilger, N., Saez, E., Schanzenbach, D. W., & Yagan, D. (2011). How does your kindergarten classroom affect your earnings? Evidence from Project STAR. Quarterly Journal of Economics, 126(4), 1593–1660.

Cleaver, S., Detrich, R., & States, J. (2018). Overview of teacher formal evaluation. Oakland, CA: The Wing Institute. Retrieved from https://www.winginstitute.org/teacher-evaluation-formal

Condie, S., Lefgren, L., & Sims, D. (2014). Teacher heterogeneity, value-added and education policy. Economics of Education Review, 40, 76–92.

Crocker, L. M., & Algina, J. (1986). Introduction to classical and modern test theory. New York: Holt, Rinehart and Winston.

David, J. L. (2010). What research says about using value-added measures to evaluate teachers. Educational Leadership, 67(8), 81–82.

Everson, K. C. (2017). Value-added modeling and educational accountability: Are we answering the real questions? Review of Educational Research, 87(1), 35–70.

Goldhaber, D., & Hansen, M. (2008). Is it just a bad class? Assessing the stability of measured teacher performance.Working paper 2008-5. Seattle: Center on Reinventing Public Education, University of Washington.

Goldring, E., Grisson, J. A., Rubin, M., Neumerski, C. M., Cannata, M., Drake, T., Schuermann, P. (2015). Make room value added: Principals’ human capital decisions and the emergence of teacher observation data. Educational Researcher, 44(2), 96–104.

Hanushek, E. A. & Rivkin, S. G. (2010). Generalizations about using value-added measures of teacher quality. American Economic Review, 100(2), 267–271.

Hanushek, E. A., & Woessman, L. (2008). The role of cognitive skills in economic development. Journal of Economic Literature, 46(3), 607–668. Retrieved from: http://hanushek.stanford.edu/sites/default/files/publications/Hanushek%2BWoessmann%202008%20JEL%2046%283%29.pdf

Harris, D. N., & Sass, T. R. (2014). Skills, productivity, and the evaluation of teacher performance. Economics of Education Review, 40, 183–204.

Jacob, B. A., & Lefgren, L. (2008). Can principals identify effective teachers? Evidence on subjective performance evaluation in education. Journal of Labor Economics, 26(1), 101–136.

Jiang, J. Y., Sporte, S. E., & Luppescu, S. (2015). Teacher perspectives on evaluation reform: Chicago’s REACH students. Educational Researcher, 44(2), 105–116.

Koretz, D. (2008). A measured approach: Value-added models are a promising improvement, but no one measure can evaluate teacher performance. American Educator, 32(3), 18–39.

Koretz, D., & Barron, S. (1998) The validity of gains on the Kentucky Instructional Results Information System (KIRIS). Santa Monica, CA: RAND Corporation.

Lazear, E. P. (2003). Teacher incentives. Swedish Economic Policy Review, 10(3), 179–214.

Lockwood, J. R., McCaffrey, D. F., Hamilton, L. S., Stecher, B. Vi-Nhuan, L., & Martinez, F. (2006). The sensitivity of value-added teacher effect estimates to different mathematics achievement measures. Santa Monica, CA: RAND Corporation. Retrieved from https://www.rand.org/content/dam/rand/pubs/reports/2009/RAND_RP1269.pdf

Martineau, J. A. (2006). Distorting value-added: The use of longitudinal, vertically scaled student achievement data for growth-based, value-added accountability. Journal of Educational and Behavioral Statistics, 31(1), 35–62.

McCaffrey, D. (2012). Do value-added methods level the playing field for teachers? Carnegie Knowledge Network. Stanford, CA: Carnegie Foundation for the Advancement of Teaching. Retrieved from http://www.carnegieknowledgenetwork.org/wp-content/uploads/2013/06/CKN_2012-10_McCaffrey.pdf

McCaffrey, D. F., & Buzick, H. (2014). Is value-added accurate for teachers of students with disabilities? Carnegie Foundation for the Advancement of Teaching. Retrieved from: http://www.carnegieknowledgenetwork.org/wp-content/uploads/2014/01/CKN_McCaffrey_Disabilities_Fourth_formatted.pdf

McCaffrey, D. F., & Lockwood, J. R. (2011). Missing data in value-added modeling of teacher effects. Annals of Applied Statistics, 5(2A), 773–797. Retrieved from: https://projecteuclid.org/euclid.aoas/1310562205

McCaffrey, D. F., Lockwood, J. R., Koretz, D. M., & Hamilton, L. S. (2003). Evaluating value-

added models for teacher accountability. RAND Corporation: Santa Monica, CA. Retrieved from: https://www.rand.org/content/dam/rand/pubs/monographs/2004/RAND_MG158.pdf

McLean, R. A., & Sanders, W. L. (1984). Objective component of teacher evaluation: A feasibility study. Working paper No. 199. Knoxville, TN: University of Tennessee, College of Business Administration.

Milanowski, A., Kimball, S., & White, B. (2004). The relationship between standards-based teacher evaluation scores and student achievement: Replication and extensions at three sites. Madison, WI: Consortium for Policy Research in Education, University of Wisconsin.

Murnane, R. J., Willett, J. B., Duhaldeborde, Y. & Tyler, J. H. (2000). How important are the cognitive skills of teenagers in predicting subsequent earnings? Journal of Policy Analysis and Management. 19(4), 547–568.

Rivkin, S. G., Hanushek, E. A., & Kain, J. F. (2005). Teachers, schools, and academic achievement. Econometrica, 73(2), 417–458.

Rothstein, J. (2008). Teacher quality in educational production: Tracking, decay, and student achievement. Working paper No. 14442. Cambridge, MA: National Bureau of Economic Research.

Ruzek, E. A., Domina, T., Conley, A. M., Duncan, G. J., & Karabenick, S. A. (2015). Using value-added models to measure teacher effects on students’ motivation and achievement. Journal of Early Adolescence, 35(5–6), 852–882.

Sanders, W. L., & Horn, S. P. (1998). Research findings from the Tennessee Value-Added Assessment System (TVAAS) database: Implications for educational evaluation and research. Journal of Personnel Evaluation in Education, 12(3), 247–256.

Steele, J. L., Hamilton, L. S., & Stecher, B. M. (2010). Incorporating student performance measures into teacher evaluation systems. Santa Monica, CA: RAND Corporation. Retrieved from https://www.rand.org/pubs/technical_reports/TR917.html

Toch, T., & Rothman, R. (2008). Rush to judgment: Teacher evaluation in public education. Washington, DC: Education Sector.

Weisberg, D., Sexton, S., Mulhern, J., Keeling, D., Schunk, J., Palcisco, A., & Morgan, K.

(2009). The widget effect: Our national failure to acknowledge and act on differences in teacher effectiveness. New York, NY: New Teacher Project.

Publications

TITLE

SYNOPSIS

CITATION

LINK

The Value of Interrupted Time-Series Experiments for Community Intervention Research

This paper advocates the use of time-series experiments for the development and evaluation of community interventions.

Biglan, A., Ary, D., & Wagenaar, A. C. (2000). The value of interrupted time-series experiments for community intervention research. Prevention Science, 1(1), 31-49.

Education Drivers

Value Added

Publications

Data Mining