The authors estimate the importance of teachers in Chicago public high schools using matched student-teacher administrative data.
Aaronson, D., Barrow, L., & Sander, W. (2007). Teachers and student achievement in the Chicago public high schools. Journal of labor Economics, 25(1), 95-135.
This article describes a school district administrator's research on optimal coaching experiences for classroom teachers. This research was done with the intent of gaining a better understanding of how coaching affects student learning.
Akhavan, N. (2015). Coaching side by side: One-on-one collaboration creates caring, connected
teachers. Journal of Staff Development, 36,34-37.
Two experiments are reported which test the effect of increased three-term contingency trials on students' correct and incorrect math responses. The results warrant further research to test whether or not rates of presentation of three-term contingency trials are predictors of effective instruction.
Albers, A. E., & Greer, R. D. (1991). Is the three-term contingency trial a predictor of effective instruction?. Journal of Behavioral Education, 1(3), 337-354.
This book is an analysis of important conceptual and practical issues that face special education professionals.
Algozzine, J. E., Thurlow, M., & Ysseldyke, J. E. (2000). Critical issues in special education.
This case study explored the use of the Bug-in-Ear (BIE) tool for undergraduate student-teacher supervision in the hands of a novice BIE2 coach, including the ease with which BIE equipment can be set up and operated by a novice coach and naïve users in the classroom.
Almendarez, M. B., Zigmond, N., Hamilton, R., Lemons, C., Lyon, S., McKeown, M., Rock, M. (2012). Pushing the horizons of student teacher supervision: Can a bug-in-ear system be an effective plug-and-play tool for a novice electronic coach to use in student teacher supervision? ProQuest Dissertations and Theses.
The purpose of this study was to evaluate the efficacy of a classroom-teacher-delivered reading intervention for struggling readers called the Targeted Reading Intervention (TRI), designed particularly for kindergarten and first-grade teachers and their struggling students in rural, low-wealth communities.
Amendum, S. J., Vernon-Faegans, L. V., & Ginsberg, M. C. (2011). The effectiveness of a technologically facilitated classroom-based early reading intervention. The Elementary School Journal, 112, 107-131.
The purpose of this study was to evaluate the efficacy of a classroom-teacher-delivered reading intervention for struggling readers called the Targeted Reading Intervention (TRI), designed particularly for kindergarten and first-grade teachers and their struggling students in rural, low-wealth communities.
Amendum, S. J., Vernon-Feagans, L., & Ginsberg, M. C. (2011). The effectiveness of a technologically facilitated classroom-based early reading intervention: The targeted reading intervention. The Elementary School Journal, 112(1), 107-131.
Value-Added Models (VAMs) has been embraced by many states and school districts as part of educational accountability systems. Value-Added Assessment (VAA) Models attempt to estimate effects of individual teachers or schools on student achievement while accounting for differences in student background. This paper provides a summary of the American Statistical Associations analysis of the efficacy of value-added modeling in education.
American Statistical Association. (2014). ASA statement on using value-added models for educational assessment. Alexandria, VA.
Value-Added Models (VAMs) has been embraced by many states and school districts as part of educational accountability systems. Value-Added Assessment (VAA) Models attempt to estimate effects of individual teachers or schools on student achievement while accounting for differences in student background. This paper provides a summary of the American Statistical Associations analysis of the efficacy of value-added modeling in education.
American Statistical Association. (2014). ASA statement on using value-added models for educational assessment. Alexandria, VA.
This article discusses instructional coaching as well as the eight factors that can increase the likelihood that coaching will be a real fix for a school. Instructional coaching holds much potential for improving the way teachers teach and the way students learn, but that potential will only be realized if leaders plan their coaching program with care.
Annenburg Institute for School Reform. (2004). Instructional Coaching: Professional development strategies that improve instruction.
This article discusses instructional coaching as well as the eight factors that can increase the likelihood that coaching will be a real fix for a school. Instructional coaching holds much potential for improving the way teachers teach and the way students learn, but that potential will only be realized if leaders plan their coaching program with care.
Annenburg Institute for School Reform. (2004). Instructional Coaching: Professional development strategies that improve instruction.
A report by TNTP finds 99 percent of teachers are rated good or great, confirming related findings that evaluation systems are not meaningfully differentiating teachers or providing useful feedback. TNTP recommends states use student growth as one measure of teacher effectiveness.
Aragon, S. (2018). Teacher Evaluations: What Is the Issue and Why Does It Matter? Policy Snapshot. Education Commission of the States.
This paper evaluates the impact of a performance-related pay scheme for teachers in England.
Atkinson, A., Burgess, S., Croxson, B., Gregg, P., Propper, C., Slater, H., & Wilson, D. (2009). Evaluating the impact of performance-related pay for teachers in England. Labour Economics, 16(3), 251-261.
There is also little or no evidence for the claim that teachers will be more motivated to improve student learning if teachers are evaluated or monetarily rewarded for student test score gains.
Baker, E. L., Barton, P. E., Darling-Hammond, L., Haertel, E., Ladd, H. F., Linn, R. L., ... & Shepard, L. A. (2010). Problems with the Use of Student Test Scores to Evaluate Teachers. EPI Briefing Paper# 278. Economic Policy Institute.
Bad teachers were the problem; good teachers were the solution. It was a simplified binary, but the idea and the research it drew on had spurred policy changes across the country, including a spate of laws establishing new evaluation systems designed to reward top teachers and help weed out low performers. Behind that effort was the Bill and Melinda Gates Foundation, which backed research and advocacy that ultimately shaped these changes.
Barnum, M. (2018, June 21). The Gates Foundation bet big on teacher evaluation. The report it commissioned explains how those efforts fell short. Chalkbeat.Retrieved from https://www.chalkbeat.org/posts/us/2018/06/21/the-gates-foundation-bet-big-on-teacher-evaluation-the-report-it-commissioned-explains-how-those-efforts-fell-short/
This article focuses on one method of follow-up: performance feedback
Barton, E. E., Kinder, K., Casey, A. M., & Artman, K. M. (2011). Finding your feedback fit: Strategies for designing and delivering performance feedback systems. Young Exceptional Children, 14(1), 29–46. doi: 10.1177/1096250610395459
The book presents many examples of Questioning the Author (QtA) in action as children engage with narrative and expository texts to construct meaning.
Beck, I. L., & McKeown, M. G., Hamilton, R. L., & Kugan, L. (1997). Questioning the Author: An approach for enhancing student engagement with text.Newark, DE: International Reading Association.
The book presents many examples of Questioning the Author (QtA) in action as children engage with narrative and expository texts to construct meaning.
Beck, I. L., & McKeown, M. G., Hamilton, R. L., & Kugan, L. (1997). Questioning the Author: An approach for enhancing student engagement with text.Newark, DE: International Reading Association.
This paper introduces analysis techniques and results showing how student growth percentiles, a normative growth analysis technique, can be used to examine the illuminate the relationship between standards based accountability systems and the performance standards on which they are based.
Betebenner, D. W. (2009). Growth, Standards and Accountability.
This study used a delayed multiple-baseline across-participants design to analyze the effects of coaching on special education teachers’ implementation of function-based interventions with students with severe disabilities. This study also examined the extent to which teachers could generalize function-based interventions in different situations.
Bethune, K. S., & Wood, C. L. (2013). Effects of coaching on teachers’ use of function-based interventions for students with severe disabilities. Teacher Education and Special Education, 36(2), 97-114.
In fall 2009, the Bill & Melinda Gates Foundation launched the Measures of Effective Teaching (MET) project to test new approaches to measuring effective teaching. The goal of the MET project is to improve the quality of information about teaching effectiveness available to education professionals within states and districts.
Bill and Melinda Gates Foundation. (2010). Learning about teaching: Initial findings from the measures of effective teaching project.Retrieved from https://docs.gatesfoundation.org/documents/preliminary-findings-research-paper.pdf
Over the objection of the teachers' union, the Board of Education here on Thursday unanimously approved the nation's largest merit pay program, which calls for rewarding teachers based on how well their students perform on standardizes tests.
Blumenthal, R. (2006). Houston ties teachers’ pay to test scores. New York Times, 13.
Teacher professional development is essential to efforts to improve our schools. This article maps the terrain of research on this important topic. It first provides an overview of what we have learned as a field, about effective professional development programs and their impact on teacher learning.
Borko, H. (2004). Professional development and teacher learning: Mapping the terrain. Educational Researcher, 30(8), 3–15.
These five articles begin to build a bridge between literature. specifically, they report on how the use of the indicators derived from value-added models (VAM) actually payout in practice and give carefully consideration to how the design and implementation of teacher evaluation system could be modified to enhance the positive impact of accountability and mitigate the negative consequences,
Braun, H. (2015). The value in value added depends on the ecology. Educational Researcher, 44(2), 127–131. Retrieved from https://doi.org/10.3102%2F0013189X15576341
This book divides itself naturally into two parts. The first part has to do with the situation in which Superintendent Brooks found himself, with his successful campaign in educating his teachers to use standardized tests, with the results which he obtained, with the way he used these results to grade his pupils, to rate his teachers, and to evaluate methods of teaching, and finally with the use he made of intelligence tests.
Brooks, S. S. (1905). Improving schools by standardized tests. Houghton Mifflin.
This paper examines how to measure teacher performance and the practices necessary for increasing teacher trust in systems designed to effectively measure performance.
Cantrell, S., & Scantlebury, J. (2011). Effective Teaching: What Is It and How Is It Measured?. Effective Teaching as a Civil Right, 28.
This book provide detailed information on how to systematically and explicitly teach essential reading skills. The procedures describe in this text have been shown to benefit all student, especially powerful with the most vulnerable learners, children who are at risk because of poverty, disability, or limited knowledge of English.
Carnine, D., Silbert, J., Kameenui, E. J., & Tarver, S. G. (1997). Direct instruction reading. Columbus, OH: Merrill.
This paper presents the results from a field experiment that examines the effects of nonfinancial performance feedback on the behavior of professionals working for an insurance repair company.
Casas‐Arce, P. A. B. L. O., Lourenço, S. M., & MARTÍNEZ‐JEREZ, F. A. (2017). The performance effect of feedback frequency and detail: Evidence from a field experiment in customer satisfaction. Journal of Accounting Research, 55(5), 1051-1088.
This paper discusses the search for a “magic metric” in education: an index/number that would be generally accepted as the most efficient descriptor of school’s performance in a district.
Celio, M. B. (2013). Seeking the Magic Metric: Using Evidence to Identify and Track School System Quality. In Performance Feedback: Using Data to Improve Educator Performance (Vol. 3, pp. 97-118). Oakland, CA: The Wing Institute.
In a time when public schools continue to be scrutinized, school leadership never mattered more in order to exercise school reform. This qualitative study examined how five principals working in an urban school district perceived their evaluation and how it contributed to their practice.
Chacon-Robles, B. (2018). Improving instructional leadership: A multi-case study perspectives on formal evaluations. The University of Texas at El Paso.
This report presents case studies of the efforts by three school districts, Hillsborough County Public Schools (HCPS), Memphis City Schools (MCS), and Pittsburgh Public Schools (PPS), to launch, implement, and operate new teacher evaluation systems as part of a larger reform effort called the Partnership Sites to Empower Effective Teaching.
Chambers, J., Brodziak de los Reyes, I., & O'Neil, C. (2013). How Much are Districts Spending to Implement Teacher Evaluation Systems?.
This paper examines the issue of efficacy of value-added measures in evaluating teachers. This question is important in understanding whether value-added analysis provides unbiased estimates of teachers’ impact on student achievement and whether these teachers improve long-term student outcomes.
Chetty, R., Friedman, J. N., & Rockoff, J. E. (2011). The long-term impacts of teachers: Teacher value-added and student outcomes in adulthood (No. w17699). National Bureau of Economic Research.
This paper evaluates the long-term impacts of STAR by linking the experimental data to administrative records.
Chetty, R., Friedman, J., Hilger, N., Saez, E., Schanzenbach, D. W., & Yagan, D. (2011). How does your kindergarten classroom affect your earnings? Evidence from Project STAR. Quarterly Journal of Economics, 126(4), 1593–1660.
This overview provides information about teacher evaluation as it relates to collecting information about teacher practice and using it to improve student outcomes. The history of teacher evaluation and current research findings and implications are included.
Cleaver, S., Detrich, R. & States, J. (2018). Overview of Teacher Evaluation. Oakland, CA: The Wing Institute. https://www.winginstitute.org/quality-teachers-evaluation.
The purpose of this overview is to provide information about teacher coaching as it is used in schools, the research that examines this practice as a method of teacher professional development, and its impact on student outcomes.
Cleaver, S., Detrich, R. & States, J. (2018). Overview of Teacher Evaluation. Oakland, CA: The Wing Institute. https://www.winginstitute.org/teacher-evaluation-teacher-coaching.
The purpose of this overview is to provide information about the role of formal teacher evaluation, the research that examines the practice, and its impact on student outcomes.
Cleaver, S., Detrich, R. & States, J. (2018). Overview of Teacher Formal Evaluation. Oakland, CA: The Wing Institute.https://www.winginstitute.org/teacher-evaluation-formal.
The purpose of this overview is to provide information about the role of formal teacher evaluation, the research that examines the practice, and its impact on student outcomes.
Cleaver, S., Detrich, R. & States, J. (2018). Overview of Teacher Formal Evaluation. Oakland, CA: The Wing Institute.https://www.winginstitute.org/teacher-evaluation-formal.
This overview examines the current understanding of research on performance feedback as a way to improve teacher performance and student outcomes.
Cleaver, S., Detrich, R. & States, J. (2019). Overview of Performance Feedback. Oakland, CA: The Wing Institute. https://www.winginstitute.org/teacher-evaluation-feedback.
The purpose of this paper on value-added research in education is to define this type of research, provide an overview of how it has been conducted, and discuss its benefits and limitations.
Cleaver, S., Detrich, R. & States, J. (2020). Overview of Value-Added Research in Education: Reliability, Validity, Efficacy, and Usefulness. Oakland, CA: The Wing Institute. https://www.winginstitute.org/staff-value-added.
The purpose of this overview is to provide information about informal evaluation as a practice in schools, and the current understanding of research related to informal teacher evaluation to improve teacher performance and student outcomes.
Cleaver, S., Detrich, R., & States, J. (2019). Informal Teacher Evaluation. Oakland, CA: The Wing Institute. Retrieved from https://www.winginstitute.org/staff-informal.
The purpose of this overview is to provide an understanding of the research base on professional development and its impact on student achievement, as well as offer recommendations for future teacher professional development
Cleaver, S., Detrich, R., States, J. & Keyworth, R. (2020). Overview of Teacher Evaluation. Oakland, CA: The Wing Institute. quality-teachers-in-service.
This study examines the theoretical and practical implications of ranking teachers with a one-dimensional value-added metric when teacher effectiveness varies across subjects or student types.
Condie, S., Lefgren, L., & Sims, D. (2014). Teacher heterogeneity, value-added and education policy. Economics of Education Review, 40, 76-92.
This book is a comprehensive description of the principles and procedures for systematic change of socially significant behavior. It includes basic principles, applications, and behavioral research methods.
Cooper, J. O., Heron, T. E., & Heward, W. L. (2007). Applied behavior analysis.
This book is a comprehensive description of the principles and procedures for systematic change of socially significant behavior. It includes basic principles, applications, and behavioral research methods.
Cooper, J. O., Heron, T. E., & Heward, W. L. (2007). Applied behavior analysis.
In this report, the author aim to provide an accessible introduction to these new measures of teaching quality and put them into the broader context of concerns over school quality and achievement gaps.
Corcoran, S. P. (2010). Can Teachers Be Evaluated by Their Students' Test Scores? Should They Be? The Use of Value-Added Measures of Teacher Effectiveness in Policy and Practice. Education Policy for Action Series. Annenberg Institute for School Reform at Brown University (NJ1).
This text was written to help the reader acquire a base of knowledge about classical psychometrics and to integrate new ideas into that framework of knowledge.
Crocker, L. M., & Algina, J. (1986). Introduction to classical and modern test theory. New York: Holt, Rinehart and Winston.
The framework for teaching is a research-based set of components of instruction that are grounded in a constructivist view of learning and teaching. The framework defines four levels of performance--Unsatisfactory, Basic, Proficient, and Distinguished--for each element, providing a valuable tool that all teachers can use.
Danielson, C. (2007). Enhancing professional practice: A framework for teaching. ASCD.
This article addresses the topics of staff assessment, teacher supervision, and professional development.
Danielson, C. (2011). Evaluations that help teachers learn. Educational leadership, 68(4), 35-39.
This article addresses the topics of staff assessment, teacher supervision, and professional development.
Danielson, C. (2011). Evaluations that help teachers learn. Educational leadership, 68(4), 35-39.
The Framework for Teaching identifies those aspects of a teacher's responsibilities that have
been documented through empirical studies and theoretical research as promoting
improved student learning.
Danielson, C. (2013). The framework for teaching evaluation instrument, 2013 instructionally focused edition. Retrieved from: https://bibliotecadigital.mineduc.cl/bitstream/handle/20.500.12365/17302/2013_FfTEvalInstrument_Web_v1.2_20140825_.pdf
In this article the authors examine some design principles to guide policy-makers and school reformers who seek to promote learner-centred professional development which involves teachers as active and reflective participants in the change process.
Darling-Hammond, L., & McLaughlin, M. W. (1995). Policies that support professional development in an era of reform. Phi Delta Kappan, 76(8), 597–604.
Darling-Hammond, L., Amrein-Beardsley, A., Haertel, E., & Rothstein, J. (2012). Evaluating teacher evaluation: Popular modes of evaluating teachers are fraught with inaccuracies and inconsistencies, but the field has identified better approaches. Phi Delta Kappan, 93(6), 8–15.Retrieved from https://www.edweek.org/ew/articles/2012/03/01/kappan_hammond.html
This article presents a conceptual framework for examining the design and implementation of teacher evaluation processes in school organizations.
Darling-Hammond, L., Wise, A. E., & Pease, S. R. (1983). Teacher evaluation in the organizational context: A review of the literature. Review of educational research, 53(3), 285-328.
A growing number of researchers are studying whether value-added measures can do a good job of measuring the contribution of teachers to test score growth. Here I summarize a handful of analyses that shed light on two questions.
David, J. L. (2010). What research says about using value-added measures to evaluate teachers. Educational Leadership, 67(8), 81–82. Retrieved from http://www.ascd.org/publications/educational_leadership/may10/vol67/num08/Using_Value-Added_Measures_to_Evaluate_Teachers.aspx
A growing number of researchers are studying whether value-added measures can do a good job of measuring the contribution of teachers to test score growth. Here I summarize a handful of analyses that shed light on two questions.
David, J. L. (2010). What research says about using value-added measures to evaluate teachers. Educational Leadership, 67(8), 81–82. Retrieved from http://www.ascd.org/publications/educational_leadership/may10/vol67/num08/Using_Value-Added_Measures_to_Evaluate_Teachers.aspx
This study uses data collected in the intervention classrooms of Head Start REDI (Research- based, Developmentally Informed), a randomized clinical trial testing the efficacy of a comprehensive preschool curriculum targeting children’s social-emotional competence, language, and emergent literacy skills delivered by teachers who received weekly coaching support.
Domitrovich, C. E., Gest, S. D., Jones, D., Gill, S., & DeRousie, R. M. S. (2010). Implementation quality: Lessons learned in the context of the Head Start REDI trial. Early Childhood Research Quarterly, 25(3), 284-298.
The Three-Minute Classroom Walk-Through offers a practical, time-saving alternative that impacts student achievement by cultivating self-reliant teachers who are continuously improving their practice.
Downey, C. J., Steffy, B. E., English, F. W., Frase, L. E., & Poston, W. K. (2004). The three-minute classroom walkthrough: Changing school supervisory practice one teacher at a time. Thousand Oaks, CA: Corwin Press.
Principals are in a paradoxical position. On one hand, they're called on to use research-based strategies to improve student achievement. On the other, they're increasingly required to micromanage teachers by observing in classrooms and engaging in intensive evaluation. The authors point out that these two positions are at odds with each other.
Dufour, R., & Mattos, M. (2013). How Do Principals Really Improve Schools?. Educational Leadership, 70(7), 34-40.
This monograph presents a synthesis of the literature on empirically supported effective teaching principles that have been derived from research on behavioral, cognitive, social-learning, and other theories.
Ellis, E. S., Worthington, L. A., & Larkin, M. J. (1994). research synthesis on effective teaching principles and the design of quality tools for educators.(Tech. Rep. No. 6). Eugene, OR: University of Oregon, National Center to Improve the Tools of Educators.
Value-added estimates of teacher or school quality are increasingly used for both high- and low-stakes accountability purposes, making understanding of their limitations critical.
Everson, K. C. (2017). Value-added modeling and educational accountability: Are we answering the real questions?. Review of Educational Research, 87(1), 35-70.
This paper exams teachers' views on unions, tenure, pay-for-performance, alternative certification, and other issues and finds that while most teachers are strong supporters of standards, a sense of vulnerability, along with fears of politics and favoritism, make them loyal to the tenure system, loyal to their unions, and highly skeptical about pay tied to student test scores.
Farkas, S., Johnson, J., & Duffett, A. (2003). Stand by me: What teachers say about unions,
merit pay, and other professional matters. New York: Public Agenda.
The Every Student Succeeds Act (ESSA) returns decision making for our nation’s education back where it belongs – in the hands of local educators, families, and communities – while keeping the focus on students most in need.
Fennell, M. (2016). What educators need to know about ESSA. Educational Leadership, 73, 62–65.
This paper describes a continuous learning model of principal and superintendent support. The model places a premium on engagement at all levels of the system on shaping a focused culture of instruction within their schools.
Fink, E., & Resnick, L. B. (2001). Developing principals as instructional leaders. Phi Delta Kappan, 82(8), 598-610.
This book shows how principals and other school leaders can develop the skills necessary for teachers to deliver high quality instruction by introducing principals to a five-part model of effective instruction.
Fink, S., & Markholt, A. (2011). Leading for instructional improvement: How successful leaders develop teaching and learning expertise. John Wiley & Sons.
Fisher, D., Frey, N., & Lapp, D. (2011). Coaching middle-level teachers to think aloud improves comprehension instruction and student reading achievement. The Teacher Educator, 46(3), 231-243.
This article describes a school-based randomized trial in over 200 New York City public schools designed to better understand the impact of teacher incentives.
Fryer, R. G. (2013). Teacher incentives and student achievement: Evidence from New York City public schools. Journal of Labor Economics, 31(2), 373-407.
This article aimed to present frameworks and practices coaches can use with classroom teachers to facilitate the implementation of evidence-based interventions in schools.
Garbacz, S. A., Lannie, A. L., Jeffrey-Pearsall, J. L., & Truckenmiller, A. J. (2015). Strategies for effective classroom coaching. Preventing School Failure: Alternative Education for Children and Youth, 59(4), 263-273.
This research synthesis examines how teacher effectiveness is currently measured (i.e., formative vs. summative evaluation).
Goe, L., Bell, C., & Little, O. (2008). Approaches to Evaluating Teacher Effectiveness: A Research Synthesis. National Comprehensive Center for Teacher Quality.
This guide is a tool designed to assist states and districts in constructing high-quality teacher evaluation systems in an effort to improve teaching and learning.
Goe, L., Holdheide, L., & Miller, T. (2011). A Practical Guide to Designing Comprehensive Teacher Evaluation Systems: A Tool to Assist in the Development of Teacher Evaluation Systems. National Comprehensive Center for Teacher Quality.
This paper report on work estimating the stability of value-added estimates of teacher effects, an important area of investigation given that new workforce policies implicitly assume that effectiveness is a stable attribute within teachers.
Goldhaber, D. D., & Hansen, M. (2008). Is it Just a Bad Class?: Assessing the Stability of Measured Teacher Performance. Seattle, WA: Center on Reinventing Public Education.
This paper report on work estimating the stability of value-added estimates of teacher effects, an important area of investigation given that new workforce policies implicitly assume that effectiveness is a stable attribute within teachers.
Goldhaber, D. D., & Hansen, M. (2008). Is it Just a Bad Class?: Assessing the Stability of Measured Teacher Performance. Seattle, WA: Center on Reinventing Public Education.
Interview and survey data from six school districts that have recently implemented new evaluation systems with classroom observations provide evidence that principals tend to rely less on test scores in their human capital decisions.
Goldring, E., Grissom, J. A., Rubin, M., Neumerski, C. M., Cannata, M., Drake, T., & Schuermann, P. (2015). Make room value added: Principals’ human capital decisions and the emergence of teacher observation data. Educational Researcher, 44(2), 96-104.
This paper provide some recommendations to increase the pool of potential teachers, make it tougher to award tenure to those who perform least well, and reward effective teachers who are willing to work in schools serving large numbers of low-income, disadvantaged children.
Gordon, R., Kane, T. J., & Staiger, D. O. (2006). Identifying Effective Teachers Using Performance on the Job. The Hamilton Project Policy Brief No. 2006-01. Brookings Institution.
This paper describe a few promising assessment technologies tat allow us to capture more direct, repeated, and contextually based measures of student learning, and propose an improvement-oriented approach to teaching and learning.
Greenwood, C. R., & Maheady, L. (1997). Measurable change in student performance: Forgotten standard in teacher preparation?. Teacher Education and Special Education, 20(3), 265-275.
Is dismissing poorly performing teachers truly feasible in America today? After all the political capital (and real capital) spent on reforming teacher evaluation, can districts actually terminate ineffective teachers who have tenure or have achieved veteran status?
Griffith, D., & McDougald, V. (2016). Undue process: Why bad teachers in twenty-five diverse districts rarely get fired. Washington DC: Thomas B. Fordham Institute. Retrieved from http://edex. s3-us-west-2. amazonaws. com/publication/pdfs, 2812, 29.
This overview describes strategies for how school personnel can respond when disruptive behavior occurs, including (1) negative consequences that can be applied as primary interventions, (2) functional behavior assessment, and (3) function-based, individualized interventions characteristic of the secondary or tertiary tiers of a multitiered system of support.
Guinness, K., Detrich, R., Keyworth, R. & States, J. (2020). Overview of Decreasing Inppropriate Behavior. Oakland, CA: The Wing Institute. https://www.winginstitute.org/classroom-inappropriate-behaviors.
A research synthesis confirms the difficulty of translating professional development into student achievement gains despite the intuitive and logical connection. Those responsible for planning and implementing professional development must learn how to critically assess and evaluate the effectiveness of what they do.
Guskey, T. R., & Yoon, K. S.(2009). What works in professional development? Phi Delta Kappan.doi: 10.1177003172170909000709.
The major objective of this data analysis was to estimate the relationship between variables which can be controlled by public policy and educational output.
Hanushek, E. A. (1971). Teacher characteristics and gains in student achievement: Estimation using micro data. American Economic Review, 61(2), 280-288.
This discussion provides a quantitative statement of one approach to achieving the governors’ (and the nation’s) goals – teacher deselection.
Hanushek, E. A. (2009). Teacher deselection. Creating a new teaching profession, 168, 172-173.
The precise method of attributing differences in classroom achievement to teachers is the
subject of considerable discussion and analysis.
Hanushek, E. A., & Rivkin, S. G. (2010). Generalizations about using value-added measures of teacher quality. American Economic Review, 100(2), 267-71.
This paper reviews the role of cognitive skills in promoting economic well-being, with a particular focus on the role of school quality and quantity.
Hanushek, E. A., & Woessmann, L. (2008). The role of cognitive skills in economic development. Journal of economic literature, 46(3), 607-68.
The authors examine the relationships between observational ratings of teacher performance, principals’ evaluations of teachers’ cognitive and non-cognitive skills and test-score based measures of teachers’ productivity.
Harris, D. N., & Sass, T. R. (2014). Skills, productivity and the evaluation of teacher performance. Economics of Education Review, 40, 183-204.
This paper reports on the analysis of state statutes and department of education regulations in fifty states for changes in teacher evaluation in use since the passage of No Child Left Behind Act of 2001.
Hazi, H. M., & Rucinski, D. A. (2009). Teacher evaluation as a policy target for improved student learning: A fifty-state review of statute and regulatory action since NCLB. education policy analysis archives, 17, 5.
This paper examined the effects of a professional development intervention that included data-based performance feedback delivered via electronic mail (e-mail) on preschool teachers’ use of descriptive praise and whether increased use of descriptive praise was associated with changes in classroom-wide measures of child engagement and challenging behavior.
Hemmeter, M. L., Snyder, P., Kinder, K., & Artman, K. (2011). Impact of performance feedback delivered via electronic mail on preschool teachers’ use of descriptive praise. Early Childhood Research Quarterly, 26(1), 96-109.
This article discusses the current focus on using teacher observation instruments as part of new teacher evaluation systems being considered and implemented by states and districts.
Hill, H., & Grossman, P. (2013). Learning from teacher observations: Challenges and opportunities posed by new teacher evaluation systems. Harvard Educational Review, 83(2), 371-384.
NSDC opens the door to professional learning that ensures great teaching for every student every day
Hirsh, S. (2009). A new definition. Journal of Staff Development, 30(4), 10–16.
Teacher merit pay has garnered significant attention as a promising reform method for improving teacher performance and, more importantly, student achievement scores. This meta-analysis, which examined findings from 44 studies of teacher merit pay, found that merit pay is associated with a modest, statistically significant, positive effect on student test scores. The research also found that not all merit pay programs are equal. The best results are dependent on constructing efforts that incorporate sound, evidence-based practice elements.
https://journals.sagepub.com/doi/abs/10.3102/0002831220905580
This paper examines how well principals can distinguish between more and less effective teachers. To put principal evaluations in context, we compare them with the traditional determinants of teacher compensation-education and experience-as well as value-added measures of teacher effectiveness.
Jacob, B. A., & Lefgren, L. (2008). Can principals identify effective teachers? Evidence on subjective performance evaluation in education. Journal of Labor Economics, 26(1), 101-136.
This study draws on 32 interviews from a random sample of teachers and 2 years of survey data from more than 12,000 teachers per year to measure their perceptions of the clarity, practicality, and cost of the new system.
Jiang, J. Y., Sporte, S. E., & Luppescu, S. (2015). Teacher perspectives on evaluation reform: Chicago’s REACH students. Educational Researcher, 44(2), 105-116.
The 1988 NAEP surveyed American students' knowledge of reading, writing, civics, U.S. history, and geography.
Johnson, E. G., & Zwick, R. (1990). Focusing the new design: The NAEP 1988 technical report. Journal of Educational and Behavioral Studies, 17,95–109.
This book provides research as well as case studies of successful professional development strategies and practices for educators.
Joyce, B. R., & Showers, B. (2002). Student achievement through staff development. ASCD.
This study investigated the effects of training preschool teachers to use environmental arrangement and milieu teaching in interactions with children using augmented communication systems. Three teachers were taught seven environmental strategies and four milieu teaching procedures through written materials, lecture, modeling, role-playing, and feedback.
Kaiser, A. P., Ostrosky, M. M., & Alpert, C. L. (1993). Training teachers to use environmental arrangement and milieu teaching with nonvocal preschool children. Journal of the Association for Persons with Severe Handicaps, 18(3), 188-199.
The authors examined the effectiveness of self-monitoring for increasing the rates of teacher praise statements and the acceptability of using this technique for teachers. This study's results support the use of self-monitoring to increase effective teaching practices, namely praise, and further demonstrates high social validity for the participant and the students.
Kalis, T. M., Vannest, K. J., & Parker, R. (2007). Praise counts: Using self-monitoring to increase effective teaching practices. Preventing School Failure: Alternative Education for Children and Youth, 51(3), 20-27.
This study used a random-assignment experiment in Los Angeles Unified School District to evaluate various non-experimental methods for estimating teacher effects on student test scores. Having estimated teacher effects during a pre-experimental period, the authors used these estimates to predict student achievement following random assignment of teachers to classrooms.
Kane, T. J., & Staiger, D. O. (2008). Estimating teacher impacts on student achievement: An experimental evaluation (No. w14607). National Bureau of Economic Research.
This report presents an in-depth discussion of the analytical methods and findings from the Measures of Effective Teaching (MET) project’s analysis of classroom observations.1 A nontechnical companion report describes implications for policymakers and practitioners.
Kane, T. J., & Staiger, D. O. (2012). Gathering Feedback for Teaching: Combining High-Quality Observations with Student Surveys and Achievement Gains. Research Paper. MET Project. Bill & Melinda Gates Foundation.
This paper combines information from classroom-based observations and measures of teachers' ability to improve student achievement as a step toward addressing these challenges. The results point to the promise of teacher evaluation systems that would use information from both classroom observations and student test scores to identify effective teachers.
Kane, T. J., Taylor, E. S., Tyler, J. H., & Wooten, A. L. (2011). Identifying effective classroom practices using student achievement data. Journal of human Resources, 46(3), 587-613.
This paper combines information from classroom-based observations and measures of teachers' ability to improve student achievement as a step toward addressing these challenges. The results point to the promise of teacher evaluation systems that would use information from both classroom observations and student test scores to identify effective teachers.
Kane, T. J., Taylor, E. S., Tyler, J. H., & Wooten, A. L. (2011). Identifying effective classroom practices using student achievement data. Journal of human Resources, 46(3), 587-613.
This paper combines information from classroom-based observations and measures of teachers' ability to improve student achievement as a step toward addressing these challenges. The results point to the promise of teacher evaluation systems that would use information from both classroom observations and student test scores to identify effective teachers.
Kane, T. J., Taylor, E. S., Tyler, J. H., & Wooten, A. L. (2011). Identifying effective classroom practices using student achievement data. Journal of human Resources, 46(3), 587-613.
The authors proposed a preliminary FI theory (FIT) and tested it with moderator analyses. The central assumption of FIT is that FIs change the locus of attention among 3 general and hierarchically organized levels of control: task learning, task motivation, and meta-tasks (including self-related) processes.
Kluger, A. N., & DeNisi, A. (1996). The effects of feedback interventions on performance: A historical review, a meta-analysis, and a preliminary feedback intervention theory. Psychological bulletin, 119(2), 254.
this study presents and apply a framework for measuring the cost of coaching programs to 3 schools. Then the study discusses strategies for reducing the average cost of instructional coaching.
Knight, D. S. (2012). Assessing the cost of instructional coaching. Journal of Education Finance, 52-80.
The education policy community is abuzz with interest in value-added modeling as a way to estimate the effectiveness of schools and especially teachers. Value-added models provide useful information, but that information is error-prone and has a number of other important limitations.
Koretz, D. (2008). A measured approach. American Educator, 32(2), 18-39.
This study evaluated the extent to which the large performance gains shown on KIRIS represented real improvements in student learning rather than inflation of scores.
Koretz, D. M., & Barron, S. I. (1998). The validity of gains on the Kentucky instructional results information system. KIRIS). Santa Monica: RAND.
This study review the empirical literature on teacher coaching and conduct meta-analyses to estimate the mean effect of coaching programs on teachers’ instructional practice and students’ academic achievement.
Kraft, M. A., Blazar, D., & Hogan, D. (2018). The effect of teacher coaching on instruction and achievement: A meta-analysis of the causal evidence. Review of Educational Research, 88(4), 547-588.
The authors conducted a comprehensive review of research to identify the impact of coaching on changes in preservice and in-service teachers’ implementation of evidence-based practices.
Kretlow, A. G., & Bartholomew, C. C. (2010). Using coaching to improve the fidelity of evidence-based practices: A review of studies. Teacher Education and Special Education, 33(4), 279-299.
This study examined the effects of in-service plus follow-up coaching on first grade teachers’ accurate delivery of three research-based strategies during math instruction.
Kretlow, A. G., Cooke, N. L., & Wood, C. L. (2012). Using in-service and coaching to increase teachers’ accurate use of research-based strategies. Remedial and Special Education, 33(6), 348-361.
This study examined the effects of in-service support plus coaching on kindergarten teachers’ accurate delivery of group instructional units in math.
Kretlow, A. G., Wood, C. L., & Cooke, N. L. (2011). Using in-service and coaching to increase kindergarten teachers’ accurate delivery of group instructional units. The Journal of Special Education, 44(4), 234-246.
The seven guiding principles in this manuscript offer research-based directions for literacy coaching.
L’Allier, S., Elish-Piper, L., & Bean, R. M. (2011). What matters for elementary literacy coaching? Guiding principles for instructional improvement and student achievement. The Reading Teacher, 63,544-554. doi: 10.1598/RT.63.7.2
This paper challenge the “more is better” assumption and propose that frequent feedback can overwhelm an individual’s cognitive resource capacity, thus reducing task effort and producing an inverted-U relationship with learning and performance over time.
Lam, C. F., DeRue, D. S., Karam, E. P., & Hollenbeck, J. R. (2011). The impact of feedback frequency on learning and task performance: Challenging the “more is better” assumption. Organizational Behavior and Human Decision Processes, 116(2), 217-228.
The purpose of this study was to examine the validity of teacher evaluation scores that are derived from an observation tool, adapted from Danielson's Framework for Teaching, designed to assess 22 teaching components from four teaching domains.
Lash, A., Tran, L., & Huang, M. (2016). Examining the Validity of Ratings from a Classroom Observation Instrument for Use in a District's Teacher Evaluation System. REL 2016-135. Regional Educational Laboratory West.
Three questions are addressed. First, what are the principles behind creating optimal teacher incentives, and how close do the actual structures in Sweden and the US conform to the ideal ones? Second, how much is performance affected by creating incentives for current teachers, and how much by changing the pool of teacher applicants? Third, do teacher preferences align with those of their students and of society in general, and if not, why not? Associated with each of these questions are policy implications that may remedy existing distortions.
Lazear, E. P. (2003). Teacher incentives. Swedish Economic Policy Review, 10(2), 179-214.
Using longitudinal data from a cohort of middle school students from a large school district,
we estimate separate “value‐added” teacher effects for two subscales of a mathematics
assessment under a variety of statistical models varying in form and degree of control for
student background characteristics.
Lockwood, J. R., McCaffrey, D. F., Hamilton, L. S., Stecher, B., Le, V. N., & Martinez, J. F. (2007). The sensitivity of value‐added teacher effect estimates to different mathematics achievement measures. Journal of Educational Measurement, 44(1), 47-67.
the present study was designed to learn more about how to strengthen the integrity of the problem-solving process
Lundahl, A. A. (2010). Effects of Performance Feedback and Coaching on the Problem-Solving Process: Improving the Integrity of Implementation and Enhancing Student Outcomes. ProQuest LLC. 789 East Eisenhower Parkway, PO Box 1346, Ann Arbor, MI 48106.
This study demonstrates mathematically that the use of such “construct-shifting” vertical scales in longitudinal, value-added models introduces remarkable distortions in the value-added estimates of the majority of educators
Martineau, J. A. (2006). Distorting value-added: The use of longitudinal, vertically scaled student achievement data for growth-based, value-added accountability. Journal of Educational and Behavioral Statistics, 31(1), 35–62.
Over one year, the author asked more than 3,000 educators their opinions about these two basic purposes by presenting them with a scale that has five values.
Marzano, R. J. (2012). Teacher Evaluation: What’s fair? What’s effective? The two purposes of teacher evaluation. Educational Leadership, 70(3), 14–19. Alexandria, VA: ASCD.
The authors show school and district-level administrators how to set the priorities and support the practices that will help all teachers become expert teachers. Their five-part framework is based on what research tells us about how expertise develops.
Marzano, R. J., Frontier, T., & Livingston, D. (2011). Effective supervision: Supporting the art and science of teaching. Ascd.
Building on the analysis that was first reported in School Leadership That Works, the authors of Balanced Leadership identify the 21 responsibilities associated with effective leadership and show how they relate to three overarching responsibilities:
Marzano, R. J., Waters, T., & McNulty, B. A. (2001). School leadership that works: From research to results. ASCD.
This study examines the effect of a comprehensive literacy-coaching program focused on enacting a discussion-based approach to reading comprehension instruction (content-focused coaching [CFC]) on the quality of classroom text discussions over 2 years.
Matsumura, L. C., Garnier, H.E., Spybrook, J. (2012). The effect of content-focused coaching on the quality of classroom text discussions. Journal of Teacher Education, 63,214-228.
In this brief, we discuss what is and is not known about how well value‐added measures level the playing field for teachers by controlling for student characteristics.
McCaffrey, D. F. (2012). Do value-added methods level the playing field for teachers. Carnegie Knowledge Network.
In this brief, we discuss the challenges of using value-added to evaluate teachers of students with disabilities.
McCaffrey, D. F., & Buzick, H. (2014). Is value-added accurate for teachers of students with disabilities. Carnegie Knowledge Network Brief, (14).
The current study extends recent value-added modeling approaches for longitudinal student achievement data Lockwood et al. [J. Educ. Behav. Statist. 32 (2007) 125–150] to allow data to be missing not at random via random effects selection and pattern mixture models, and applies those methods to data from a large urban school district to estimate effects of elementary school mathematics teachers.
McCaffrey, D. F., & Lockwood, J. R. (2011). Missing data in value-added modeling of teacher effects. The Annals of Applied Statistics, 773-797.
Value added modeling has become of interest to policymakers interested in evaluating teacher performance. The authors argue that the models work well when the schools in the sample are homogenous but as heterogeneity of the student population increases estimates of teacher effects are likely to confounded.
McCaffrey, D. F., Lockwood, J. R., Koretz, D. M., & Hamilton, L. S. (2003). Evaluating Value-Added Models for Teacher Accountability. Monograph. ERIC. Retrieved from http://eric.ed.gov/?id=ED529961
This study examines implementation of alternative student growth measures in a sample of eight school districts that were early adopters of the measures. It builds on an earlier Region al Educational Laboratory Mid-Atlantic report that described the two types of alternative student growth measures—alternative assessment–based value-added models and student learning objectives—in the early-adopting districts.
McCullough, M., English, B., Angus, M. H., & Gill, B. (2015). Alternative student growth measures for teacher evaluation: Implementation experiences of early-adopting districts (No. 8a9dfcb1bc6143608448114ea9b69d06). Mathematica Policy Research.
In December 2016, Bellwether Education Partners and The Thomas B. Fordham Institute independently released two reports centered on teacher evaluation and its consequences. Both reports offer a glimpse into ongoing challenges and opportunities with teacher evaluation reform, but they have very different analyses.
McDougald, V., Griffith, D., Pennington, K., & Mead, S. (2016). What is the purpose of teacher evaluation today? A conversation between Bellwether and Fordham. Retrieved from https://edexcellence.net/articles/what-is-the-purpose-of-teacher-evaluation-today-a-conversation-between-bellwether-and
This study documents the implementation of research-based strategies to minimize the occurrence of reading difficulties in a first-grade population. Three strategies were implemented.
Menzies, H. M, Mahdavi, J. N., & Lewis, J. L. (2008). Early intervention in reading: From research to practice. Remedial and Special Education, 29(2), 67-77.
This study examined the effect of a visual performance feedback intervention (i.e., a simple, computer-generated line graph) on teachers' rate of praise for students' academic and behavioral performance and subsequent changes in students' rates of problem behavior.
Mesa, J., Lewis-Palmer, T., & Reinke, W. (2005). Providing Teachers with Performance Feedback on Praise to Reduce Student Problem Behavior. Beyond Behavior, 15(1), 3-7.
This paper summarizes validity evidence pertaining to several different implementations of the Framework. It is based primarily on reviewing the published and unpublished studies that have looked at the relationship between teacher evaluation ratings made using systems based on the Framework and value-added measures of teacher effectiveness.
Milanowski, A. T. (2011). Validity Research on Teacher Evaluation Systems Based on the Framework for Teaching. Online Submission.
This paper reports on the results of the analysis of an additional year of evaluation and student achievement data at some research.
Milanowski, A. T., Kimball, S. M., & White, B. (2004). The Relationship Between Standards-Based Teacher Evaluation Scores and Student Achievement: Replication and Extensions at Three Sites Consortium for Policy Research in Education (CPRE)-University of Wisconsin Working Paper Series. TC, 4(01).
This study evaluated the effects of performance feedback on the implementation of a classroom intervention.
Mortenson, B. P., & Witt, J. C. (1998). The use of weekly performance feedback to increase teacher implementation of a prereferral academic intervention. School Psychology Review, 613-627.
The purpose of this study was to investigate the effects of group inservice training plus written and verbal feedback on four Head Start teachers’ use of incidental teaching. D
Mudd, J. M., & Wolery, M. (1987). Training head start teachers to use incidental teaching. Journal of the Division for Early Childhood, 11(2), 124-134.
How important are teenagers' cognitive skills in predicting subsequent labor market success? Do cognitive skills pay off in the labor market only for students who go to college? Does college benefit only students who enter with strong basic skills? These questions are often part of current policy debates about how to improve the earnings prospects for young Americans.
Murnane, R. J., Willett, J. B., Duhaldeborde, Y., & Tyler, J. H. (2000). How important are the cognitive skills of teenagers in predicting subsequent earnings?. Journal of Policy Analysis and Management, 19(4), 547-568.
This report is concerned with only one of the many causes and dimensions of the problem, but it is the one that undergirds American prosperity, security, and civility.
National Commission on Excellence in Education. (1983). A Nation at Risk: The imperative for education reform. Retrieved from https://www.edreform.com/wp-content/uploads/2013/02/A_Nation_At_Risk_1983.pdf
This report present the panel’s conclusions, an indication of the readiness for application in the classroom of the results of this research, and, if appropriate, a strategy for rapidly disseminating this information to facilitate effective reading instruction in the schools.
National Reading Panel. (2000). Report of the National Reading Panel. Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction: Reports of the subgroups(NIH Publication No. 00-4754). Washington, DC: U. S. Government Printing Office.
This study examines the impact of 2 forms of professional development on prekindergarten teachers' early language and literacy practice: coursework and coaching.
Neuman, S. B., & Wright, T. S. (2010). Promoting language and literacy development for early childhood educators: A mixed-methods study of coursework and coaching. Elementary School Journal, 11,63-86. No Child Left Behind Act of 2001, P.L. 107-110, 20 U.S.C. § 6319 (2002).
The goal of the study was to examine the effects of coaching or professional development coursework on teacher knowledge and teacher practice.
Neuman, S. B., & Wright, T. S. (2010). Promoting language and literacy development for early childhood educators: A mixed-methods study of coursework and coaching. Elementary School Journal, 111,63–86.
No Child Left Behind Act of 2001 ESEA Reauthorization
No child left behind act of 2001. Publ. L, 107-110. (2002)
This study evaluated the impact of training on treatment integrity. After finding that positive effects lasted 2-4 days, performance feedback was used to increase treatment integrity.
Noell, G. H., Witt, J. C., Gilbertson, D. N., Ranier, D. D., & Freeland, J. T. (1997). Increasing teacher intervention implementation in general education settings through consultation and performance feedback. School Psychology Quarterly, 12(1).
This research use data from a four-year experiment in which teachers and students were randomly assigned to classes to estimate teacher effects on student achievement.
Nye, B., Konstantopoulos, S., & Hedges, L. V. (2004). How large are teacher effects? Educational Evaluation and Policy Analysis, 26(3),237–257.
There is increased interest in extending the test-based evaluation framework in K-12 education to achievement in high school. High school achievement is typically measured by performance on end-of-course exams (EOCs), which test course-specific standards in subjects including algebra, biology, English, geometry, and history, among others. Recent research indicates that when students take particular courses can have important consequences for achievement and subsequent outcomes. The contribution of the present study is to develop an approach for modeling EOC test performance regarding the timing of course.
Parsons, E., Koedel, C., Podgursky, M., Ehlert, M., & Xiang, P. B. (2015). Incorporating end-of-course exam timing into educational performance evaluations. Journal of Research on Educational Effectiveness, 8(1), 130-147.
As states and districts consider potential changes to their teacher evaluation systems and policies, this paper seeks to inform those efforts by reviewing the evolution of the teacher evaluation policy movement over the last several years, identifying positive outcomes of new systems and negative consequences, and describing risks that should be considered.
Pennington, K., & Mead, S. (2016). For good measure? Teacher evaluation policy in the ESSA era. Washington, DC: Bellwether Education Partners. Retrieved from https://bellwethereducation.org/publication/good-measure-teacher-evaluation-policy-essa-era
This handbook advocates a new approach to teacher evaluation as a cooperative effort undertaken by a group of professionals.
Peterson, K. D. (2000). Teacher evaluation: A comprehensive guide to new directions and practices. Corwin Press.
In this provocative and persuasive new book, the author asserts that the secret to high performance and satisfaction-at work, at school, and at home—is the deeply human need to direct our own lives, to learn and create new things, and to do better by ourselves and our world.
Pink, D. H. (2011). Drive: The surprising truth about what motivates us. Penguin.
This article is the first to explore the extent to which teachers’ instructional alignment is associated with their contributions to student learning and their effectiveness on new composite evaluation measures using data from the Bill & Melinda Gates Foundation’s Measures of Effective Teaching study.
Polikoff, M. S, & Porter, A. C. (2014). Instructional alignment as a measure of teacher quality. Education Evaluation and Policy Analysis, 64(3), 212–225. Retrieved from http://www.aera.net/Newsroom/Recent-AERA-Research/Instructional-Alignment-as-a-Measure-of-Teaching-Quality
Effects of a 1-semester professional development (PD) intervention that included expert coaching with Head Start teachers were investigated in a randomized controlled trial with 88 teachers and 759 children.
Powell, D. R., Diamond, K. E., Burchinal, M. R., & Koehler, M. J. (2010). Effects of an early literacy professional development intervention on Head Start teachers and children. Journal of Educational Psychology, 102, 299-312.
Research using student scores on standardized tests confirms the common perception that some teachers are more effective than others. It also reveals that being taught by an effective teacher has important consequences for student achievement. The best way to assess a teacher's effectiveness is to look at his or her on-the-job performance.
RAND Education. (2012).Teachers matter: Understanding teachers’ impact on student achievement, Santa Monica, Calif.: Author. Retrieved from https://www.rand.org/pubs/corporate_pubs/CP693z1-2012-09.html
This book focuses on helping K-12 teachers increase their use of classroom management strategies that work. The Classroom Check-Up is a step-by-step model for assessing teachers' organizational, instructional, and behavior management practices; helping them develop a menu of intervention options; and overcoming obstacles to change.
Reinke, W. M., Lewis-Palmer, T., & Martin, E. (2007). The effect of visual performance feedback on teacher behavior-specific praise. Behavior Modifications, 31(3), 247–263.
This study evaluted the impact of coaching on the implementation of an intervention. Coaching with higher rates of performance feedback resulted in the highest level of treatment integrity.
Reinke, W., Stormont, M., Herman, K., & Newcomer, L. (2014). Using Coaching to Support Teacher Implementation of Classroom-based Interventions. Journal of Behavioral Education, 23(1), 150-167.
This paper disentangles the impact of schools and teachers in influencing achievement with special attention given to the potential problems of omitted or mismeasured variables and of student and school selection.
Rivkin, S. G., Hanushek, E. A., & Kain, J. F. (2005). Teachers, schools, and academic achievement. Econometrica, 73(2), 417-458.
In this study, using mixed methods, we investigated the longer term effects of eCoaching through advanced online bug-in-ear (BIE) technology.
Rock, M. L., Schumacker, R. E., Gregg, M., Howard, P. W., Gable, R. A., & Zigmond, N. (2014). How are they now? Longer term effects of e coaching through online bug-in-ear technology. Teacher Education and Special Education, 37(2), 161-181.
In order to provide accurate estimates of how much teachers affect the achievement of their students, this study used panel data covering over a decade of elementary student test scores and teacher assignment in two contiguous New Jersey school districts.
Rockoff, J. E. (2004). The impact of individual teachers on student achievement: Evidence from panel data. American economic review, 94(2), 247-252.
This research considers relationships between student achievement (knowledge and cognitive skill), teacher efficacy (Gibson & Dembo, 1984), and interactions with assigned coaches (self-report measures) in a sample of 18 grade 7 and 8 history teachers in 36 classes implementing a specific innovation with the help of 6 coaches.
Ross, J. A. (1992). Teacher efficacy and the effects of coaching on student achievement. Canadian Journal of Education, 17(1), 51–65.
The author develop falsification tests for three widely used VAM specifications, based on the idea that future teachers cannot influence students' past achievement.
Rothstein, J. (2010). Teacher quality in educational production: Tracking, decay, and student achievement. The Quarterly Journal of Economics, 125(1), 175-214.
The author develop falsification tests for three widely used VAM specifications, based on the idea that future teachers cannot influence students' past achievement.
Rothstein, J. (2010). Teacher quality in educational production: Tracking, decay, and student achievement. The Quarterly Journal of Economics, 125(1), 175-214.
Most children with autism rely on schools as their primary source of intervention, yet research has suggested that teachers rarely use evidence-based practices. To address the need for improved educational outcomes, a previously tested consultation intervention called the Collaborative Model for Promoting Competence and Success was evaluated in a 2nd randomized controlled trial, with the addition of a web-based group.
Ruble, L. A., McGrew, J. H., Toland, M. D., Dalrymple, N. J., & Jung, L. (2013). A randomized controlled trial of COMPASS web-based and face-to-face teacher coaching in autism. Journal of Consulting and Clinical Psychology, 81, 566-572.
Using data from 35 seventh-grade teachers and 2,026 students across seven schools, we employ VA methods to measure teacher contributions to students’ motivational orientations (mastery and performance achievement goals) and their mathematics performance.
Ruzek, E. A., Domina, T., Conley, A. M., Duncan, G. J., & Karabenick, S. A. (2015). Using value-added models to measure teacher effects on students’ motivation and achievement. The Journal of Early Adolescence, 35(5-6), 852-882.
In this article, we describe and report on the results of a study in Texas that tested 2 models of professional development for classroom teachers as a way of improving their practices and increasing the reading achievement of their students.
Sailors, M., & Price, L. (2010). Professional development for cognitive reading strategy instruction. Elementary School Journal, 110,301-323.
The Tennessee Value-Added Assessment System determines the effectiveness of school systems, schools, and teachers based on student academic growth over time.
Sanders, W. L., & Horn, S. P. (1998). Research findings from the Tennessee Value-Added Assessment System (TVAAS) database: Implications for educational evaluation and research. Journal of Personnel Evaluation in Education, 12(3), 247-256.
The Tennessee Value-Added Assessment System determines the effectiveness of school systems, schools, and teachers based on student academic growth over time. Research conducted utilizing data from the TVAAS database has shown that race, socioeconomic level, class size, and classroom heterogeneity are poor predictors of student academic growth. Rather, the effectiveness of the teacher is the major determinant of student academic progress.
Sanders, W. L., & Rivers, J. C. (1996). Cumulative and residual effects of teachers on future student academic achievement.
This study examined the relative magnitude of teacher effects on student achievement while simultaneously considering the in¯uences of intraclassroom heterogeneity, student achievement level, and class size on academic growth.
Sanders, W. L., Wright, S. P., & Horn, S. P. (1997). Teacher and classroom context effects on student achievement: Implications for teacher evaluation. Journal of Personnel Evaluation and Education, 11(1), 57–67.
This paper proposes a conceptual framework to analyze teacher evaluation. It elaborates on the main components of a comprehensive teacher evaluation model and explains the main aspects to be taken into account for designing a teacher evaluation model.
Santiago, P., & Benavides, F. (2009). Teacher evaluation: A conceptual framework and examples of country practices.Organisation for Economic Cooperation and Development (OECD). Retrieved from http://www.oecd.org/education/school/44568106.pdf
Sawchuk, S. (2015). Teacher evaluation: An issue overview. Education Week, 35(3), 1-6.
Sawchuk, S. (2015). Teacher evaluation: An issue overview. Education Week, 35(3), 1-6.
Teacher evaluation can be a very sensitive topic for teachers and program administrators alike. Evaluations need to be fair and relevant to both teachers and programs.
Sayavedra, M. (2014). Teacher evaluation. ORTESOL Journal, 31, 1–9.
This paper reviews the empirical research on the effectiveness of performance feedback as a means of influencing teacher implementation of interventions.
Scheeler, M. C., Ruhl, K. L., & McAfee, J. K. (2004). Providing Performance Feedback to Teachers: A Review. Teacher Education & Special Education, 27(4).
This paper evaluated a program for training 4 support staff to embed instruction within the existing activities of 5 children with disabilities in an inclusive preschool.
Schepis, M. M., Reid, D. H., Ownbey, J., & Parsons, M. B. (2001). Training support staff to embed teaching within natural routines of young children with disabilities in an inclusive preschool. Journal of applied behavior analysis, 34(3), 313-327.
The purpose of this study was to examine the effects of multilevel support on first-grade teachers' accurate use of research-based strategies during beginning reading instruction and the extent to which teachers maintained use of these strategies.
Schnorr, C. I. (2013). Effects of multilevel support on first-grade teachers' use of research-based strategies during beginning reading instruction (Doctoral dissertation, The University of North Carolina at Charlotte).
This book is organized around four dominant interrelated core issues: professional standards, a guide to applying the Joint Committee's Standards, ten alternative models for the evaluation of teacher performance, and an analysis of these selected models.
Shinkfield, A. J., & Stufflebeam, D. L. (2012). Teacher evaluation: Guide to effective practice (Vol. 41). Springer Science & Business Media.
This article describe teachers coaching teaching including the purpose, process, who should coach, and the effects of the coaching.
Showers, B. (1985). Teachers coaching teachers. Educational leadership, 42(7), 43-48.
The current study extracted and aggregated data from single-case studies that used Performance feedback (PF) in school settings to increase teachers' use of classroom-based interventions.
Solomon, B. G., Klein, S. A., & Politylo, B. C. (2012). The effect of performance feedback on teachers' treatment integrity: A meta-analysis of the single-case literature. School Psychology Review, 41(2).
This paper presents the results of a rigorous experiment examining the impact of pay for performance on student achievement and instructional practice.
Springer, M. G., Ballou, D., Hamilton, L., Le, V. N., Lockwood, J. R., McCaffrey, D. F., ... & Stecher, B. M. (2011). Teacher Pay for Performance: Experimental Evidence from the Project on Incentives in Teaching (POINT). Society for Research on Educational Effectiveness.
In his 1964 paper, "Course Improvement through Evaluation, " Lee Cronbach urged another step: a most generous inclusion of behavioral - science variables in order to examine the possible causes and effects of quality teaching He proposed that the main objective for evaluation is to uncover durable relationships -those appropriate for guiding future educational programs.
Stake, R. E. (1967). The countenance of educational evaluation. Teachers College Record, 68, 523–540.Retrieved from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.543.5561&rep=rep1&type=pdf
Peer observation aims to support the sharing of practice, and builds self-awareness about the impact of one's teaching practice in order to affect change.
State Government of Victoria, Australia. (2019). Teacher tip: Peer observation, feedback and reflection.Retrieved from https://www.education.vic.gov.au/school/teachers/classrooms/Pages/approachesppnpeerobstip.aspx
Summative assessment is an appraisal of learning at the end of an instructional unit or at a specific point in time. It compares student knowledge or skills against standards or benchmarks. Summative assessment includes midterm exams, final project, papers, teacher-designed tests, standardized tests, and high-stakes tests.
States, J., Detrich, R. & Keyworth, R. (2018). Overview of Summative Assessment. Oakland, CA: The Wing Institute. https://www.winginstitute.org/assessment-summative
Bill & Melinda Gates Foundation launched the Intensive Partnerships for Effective Teaching initiative. The initiative's goal is dramatic gains in student achievement, graduation rates, and college-going, especially for LIM students.
Stecher, B. M., Garet, M. S., Hamilton, L. S., Steiner, E. D., Robyn, A., Poirier, J., ... & de los Reyes, I. B. (2016). Improving Teaching Effectiveness: Implementation: The Intensive Partnerships for Effective Teaching Through 2013–2014. Rand Corporation.
the authors examine how the five profiled systems are addressing assessment quality, evaluating teachers in nontested subjects and grades, and assigning teachers responsibility for particular students. The authors also examine what is and is not known about the quality of various student performance measures used by school systems.
Steele, J. L., Hamilton, L. S., & Stecher, B. M. (2010). Incorporating Student Performance Measures into Teacher Evaluation Systems. Technical Report. Rand Corporation.
the authors examine how the five profiled systems are addressing assessment quality, evaluating teachers in nontested subjects and grades, and assigning teachers responsibility for particular students. The authors also examine what is and is not known about the quality of various student performance measures used by school systems.
Steele, J. L., Hamilton, L. S., & Stecher, B. M. (2010). Incorporating Student Performance Measures into Teacher Evaluation Systems. Technical Report. Rand Corporation.
Chicago Public Schools initiated the Excellence in Teaching Project, a teacher evaluation program designed to increase student learning by improving classroom instruction through structured principal–teacher dialogue.
Steinberg, M. P., & Sartain, L. (2015). Does teacher evaluation improve school performance? Experimental evidence from Chicago’s Excellence in Teaching project. Education Finance and Policy, 10(4), 535–572.
The purpose of this special issue is to address current issues, challenges, and promising approaches for providing Tier 2 behavioral interventions in school settings. Articles solicited for this issue address gaps in the literature and implementation needs and challenges specifically for Tier 2.
Stormont, M., & Reinke, W. M. (2013). Implementing Tier 2 social behavioral interventions: Current issues, challenges, and promising approaches. Journal of Applied School Psychology, 29(2), 121-125.
In this article, the authors describe key features of the multi-tiered support (MTS) continuum of intervention and assessment and present a case study to illustrate implementation of some components of the framework with four middle school teachers.
Sugai, G. (2014). Multitiered support framework for teachers’ classroom-management practices: Overview and case study of building the triangle for teachers. Journal of Positive Behavior Interventions, 16(3), 179-190.
This study has 2 purposes: examine the effect of an observation-feedback intervention on the rate of a teacher's behavior-specific praise of students with emotional and behavioral disorders (EBD) and the effect of increased rates of a teacher's behavior-specific praise on the on-task behavior of a class of students with EBD.
Sutherland, K. S., Wehby, J. H., & Copeland, S. R. (2000). Effect of varying rates of behavior-specific praise on the on-task behavior of students with EBD. Journal of Emotional and Behavioral Disorders, 8(1), 2-8.
This study explored the effects of visual performance feedback (VPF) delivered in real-time using screen sharing technology on a discrete teacher practice (i.e., positive feedback) for four general education teachers in a middle school using a multiple baseline across teachers design.
Sweigart, C. A., Landrum, T. J., & Pennington, R. C. (2015). The effect of real-time visual performance feedback on teacher feedback: A preliminary investigation. Education and Treatment of Children, 38(4), 429-450.
This study examined the effectiveness of a classroom teacher intervention, the Targeted Reading Intervention (TRI), in helping struggling readers in kindergarten and first grade. This intervention used biweekly literacy coaching in the general education classroom to help classroom teachers use diagnostic strategies with struggling readers in one-on-one 15-min sessions.
Targeted reading intervention: A coaching model to help classroom teachers with struggling readers. Learning Disability Quarterly, 35, 102-114.
This paper offers evidence that evaluation can shift the teacher effectiveness distribution through a different mechanism: by improving teacher skill, effort, or both in ways that persist long-run.
Taylor, E. S., & Tyler, J. H. (2012). The effect of evaluation on teacher performance. American Economic Review, 102(7), 3628-51.
This paper offers evidence that evaluation can shift the teacher effectiveness distribution through a different mechanism: by improving teacher skill, effort, or both in ways that persist long-run.
Taylor, E. S., & Tyler, J. H. (2012). The effect of evaluation on teacher performance. American Economic Review, 102(7), 3628-51.
In the research reported here, the authors study one approach to teacher evaluation: practice-based assessment that relies on multiple, highly structured classroom observations conducted by experienced peer teachers and administrators.
Taylor, E. S., & Tyler, J. H. (2012a). Can teacher evaluation improve teaching? Evidence of systematic growth in the effectiveness of mid-career teachers. Education Next, 12(4), 79–84. Retrieved from http://educationnext.org/can-teacher-evaluation-improve-teaching/
This report proposes six design standards that any rigorous and fair teacher evaluation system should meet. It offers a blueprint for better evaluations that can help every teacher succeed in the classroom—and give every student the best chance at success.
The New Teacher Project. (2010). Teacher Evaluation 2.0.New York, NY: Author.
"The Mirage" describes the widely held perception among education leaders that they already know how to help teachers improve, and that they could achieve their goal of great teaching in far more classrooms if they just applied what they knew more widely.
TNTP. (2015). The Mirage: Confronting the truth about our quest for teacher development. Retrieved from: https://tntp.org/publications/view/the-mirage-confronting-the-truth-about-our-quest-for-teacher-development
The authors examine the causes and consequences of the status of teacher evaluation and its implications for the current national debate about performance pay for teachers. The report also examines a number of national, state, and local evaluation systems that offer potential alternatives to current practice.
Toch, T., & Rothman, R. (2008). Rush to Judgment: Teacher Evaluation in Public Education. Education Sector Reports. Education Sector.
The authors examine the causes and consequences of the status of teacher evaluation and its implications for the current national debate about performance pay for teachers. The report also examines a number of national, state, and local evaluation systems that offer potential alternatives to current practice.
Toch, T., & Rothman, R. (2008). Rush to Judgment: Teacher Evaluation in Public Education. Education Sector Reports. Education Sector.
The authors examine the causes and consequences of the status of teacher evaluation and its implications for the current national debate about performance pay for teachers. The report also examines a number of national, state, and local evaluation systems that offer potential alternatives to current practice.
Toch, T., & Rothman, R. (2008). Rush to Judgment: Teacher Evaluation in Public Education. Education Sector Reports. Education Sector.
In this study, the results of five training studies evaluating the effects of a coaching program for use in Dutch primary and secondary schools are described.
Veenman, S, & Denessen, E. (2001). The coaching of teachers: Results of five training studies.
Educational Research and Evaluation, 7(4), 385–417.
This study evaluated whether the Targeted Reading Intervention (TRI), a classroom teacher professional development program delivered through webcam technology literacy coaching, could provide rural classroom teachers with the instructional skills to help struggling readers progress rapidly in early reading.
Vernon-Feagans, L., Kainz, K., Hedrick, A., Ginsberg, M., & Amendum, S. (2013). Live webcam coaching to help early elementary classroom teachers provide effective literacy instruction for struggling readers: The Targeted Reading Intervention. Journal of Educational Psychology, 105(4), 1175.
This report examines the pervasive and longstanding failure to recognize and respond to variations in the effectiveness of teachers.
Weisberg, D., Sexton, S., Mulhern, J., Keeling, D., Schunck, J., Palcisco, A., & Morgan, K. (2009). The widget effect: Our national failure to acknowledge and act on differences in teacher effectiveness. New Teacher Project.
Wenglinsky, H. (2002). How schools matter: The link between teacher classroom practices and student academic performance. Education Policy Analysis Archives, 10(12).
U.S. public policy has increasingly been conceived, debated, and evaluated through the lenses of politics and ideology. The fundamental question -- Will the policy work? -- too often gets short shrift or even ignored. A remedy is an evidence-based policy--a rigorous approach that draws on careful data collection, experimentation, and both quantitative and qualitative analysis to determine what the problem is, which ways it can be addressed, and the probable impacts of each of these ways.
Wesley, P. W., & Buysse, V. (2006). Making the case for evidence- based policy. In V. Buysse & P. W. Wesley (Eds.), Evidence-based practice in the early childhood field (pp. 117–159). Washington, DC: Zero to Three.
A preliminary survey of 32 school districts identified as having highly developed teacher evaluation systems was followed by the selection of 4 case study districts.
Wise, A. E., Darling-Hammond, L., Tyson-Bernstein, H, & McLaughlin, M. W. (1984). Teacher evaluation: A study of effective practices. Santa Monica, CA: RAND Corporation. Retrieved from https://files.eric.ed.gov/fulltext/ED246559.pdf
Due to the increased need to support teachers' use of evidence-based practices in multi-tiered systems of support such as RTI [Response to Intervention] and PBIS [Positive Behavior Interventions and Support], coaching can extend and strengthen professional development. This paper describes a multi-level approach to coaching and provides implications for practice and research.
Wood, C. L., Goodnight, C. I., Bethune, K. S., Preston, A. I., Cleaver, S. L. (2016). Role of professional development and multi-level coaching in promoting evidence-based practice in education. Learning Disabilities: A Contemporary Journal, 14,159-170.
The purpose of this study is to examine research to answer the question, What is the impact of teacher professional development on student achievement.
Yoon, K. S., Duncan, T., Lee, S. W. Y., Scarloss, B., & Shapley, K. L. (2007). Reviewing the Evidence on How Teacher Professional Development Affects Student Achievement. Issues & Answers. REL 2007-No. 033. Regional Educational Laboratory Southwest (NJ1).
This second edition includes an expanded set of classroom observation tools, moving from 23 to 40 and more linkages to the job-embedded nature of the informal classroom observations.
Zepeda, S. J. (2009). The instructional leaders’ guide to informal classroom observations.New York, NY: Routledge.