A New Review Finds Validity Problems Undermine Third-Grade Retention Report

East lansing, Mich. – As part of its working paper series, the National Bureau of Economic Research (NBER) recently released a report examining the outcomes of Florida’s third-grade retention policy. The report concluded, contrary to the conventional wisdom on grade retention, that third-grade retention had positive effects on the following year’s test results, but the effects fade, with no effect on graduation. Despite this, a new academic review finds several shortcomings that severely limit the report’s usefulness.

The report, The Effects of Test-based Retention on Student Outcomes Over Time: Regression Discontinuity Evidence from Florida, was reviewed by Joseph P. Robinson-Cimpian for the Think Twice think tank review project with funding from the Great Lakes Center for Education Research and Practice. Robinson-Cimpian is an associate professor and College of Education Distinguished Scholar in the Department of Educational Psychology at the University of Illinois at Urbana-Champaign. His research focuses on the use and development of novel and rigorous methods to study equity and policy.

The report attempted to investigate the impact of a Florida policy, which flags students for retention, to repeat third grade, based on a state-specified cut-score on the Florida Comprehensive Achievement Test. The findings indicated that students just below the threshold (one-third of whom were retained) performed better than those just above the threshold (5% of whom were retained) on next year’s tests.

In his review, Robinson-Cimpian notes that the report relies on what is known as a regression discontinuity design (RDD), a technique used for making causal inferences from non-experimental data when a threshold determines or strongly predicts treatment assignment (comparing students immediately above and below the law’s cut-score, lend themselves to making causal claims).

However, Robinson-Cimpian finds serious shortcomings. Most notably, because students above the cut-score do not receive the extra supports provided to students below the cut-score, the researchers cannot know if positive outcomes for those below the cut-score were due to the greater likelihood of retention or to the assurance of additional services.

Additionally, Robinson-Cimpian finds that the report exacerbates the outcome differences between those below and above the threshold by using an Instrumental Variable approach, which attributes the entire difference to just the one-third of students who are retained, effectively making the outcome difference appear more than three times as large. Importantly, Robinson-Cimpian finds the very use of the Instrumental Variable approach is inappropriate because the method assumes that failing to attain the threshold has no effect on outcomes other than through increasing the likelihood of retention.

Overall, Robinson-Cimpian points out, the methods used have extremely limited generalizability, which is restricted to students at or very near the threshold and directly affected by the policy. Even setting aside the problems generated by confounding retention effects with the effects of other interventions and supports, the findings are not easily generalizable to lower- or higher-achieving students, to other grades, or to other states with similar test-based retention policies.

Find the review on the Great Lakes Center website.

Find the original report by Guido Schwerdt and Martin R. West on the web.

Think Twice, a project of the National Education Policy Center, provides the public, policymakers and the press with timely, academically sound reviews of selected publications. The project is made possible by funding from the Great Lakes Center for Education Research and Practice.

The review can also be found on the NEPC website.

Featured

  • Quadient Achieves 25,000 Locker Installations Worldwide

    Global automation platform Quadient recently announced that it has acquired Package Concierge, a U.S.-based parcel management solutions provider, according to a news release. The acquisition pushes Quadient over the 25,000-unit milestone of global installations.

  • New Jersey PreK–12 School Breaks Ground on New STEM Building

    Saddle River Day School (SRDS) in Saddle River, N.J., recently announced that it has broken ground on the new Dr. Kristen Walsh Hall of Science & Entrepreneurship, according to a news release. The school partnered with DIGroup Architecture for the design of the new facility, which will provide the school with space to expand its STEM and business education classes.

  • Boosting Student Wellness and Safety Through Indoor-Outdoor School Spaces

    Engaging students through facilities designed for indoor and outdoor learning and activities reflects a growing awareness of how children learn and thrive, with educators recognizing the importance of getting outside and disconnecting from technology. And, as today’s youth grapple with the urgent mental health crisis of increased anxiety and loneliness fueled by both the pandemic and technology, along with a related crisis in youth physical health, the wellness benefits of getting outside have never been so palpable.

  • OpenStax Celebrates 25th Anniversary of Providing Open Educational Resources

    OpenStax, which expands access of K–12 and higher-education resources and research-informed educational tools, is celebrating its 25th anniversary as 2024 comes to a close, according to a news release. The educational initiative from Rice University has served almost 37 million students in 153 countries and saved students nearly $3 billion in educational costs since its launch in 1999.