Class size: Does it matter for student achievement?

Jepsen, Christopher

doi:10.15185/izawol.190

one-pager full article

Elevator pitch

Numerous economic studies have considered the relationship between class size and student achievement, the majority of which have focused on elementary schools in the US and Europe. While the general finding is that smaller classes are associated with increased student achievement, a few high-quality studies find no relationship. Further, empirical research on the costs and benefits of smaller classes concludes that other education policies, such as tutoring, early childhood programs, or improving teacher quality would be better investments.

Effect of a “one-student” reduction in
class size on test scores, by location

Key findings

Pros

In general, smaller classes are associated with increased student achievement, usually measured by standardized tests in multiple subjects such as mathematics and reading.

There are benefits of smaller class sizes when variation in class size (due to maximum class-size rules) and random fluctuations in population are taken into consideration.

The positive relationship between smaller class size and student achievement holds for students in different grades as well as in different countries and across continents.

Reducing class size is a clear education policy that is popular with students, parents, teachers, and policymakers, and hence easy to argue for.

Cons

Several high-quality studies find no relationship between class size and student achievement.

Reducing class size is a very expensive policy reform relative to other reforms, which may also provide better value.

Because most studies focus on elementary schools, much less is known about the relationship between class size and student achievement in secondary schools.

Reducing class size in many schools in developing countries is unlikely to improve achievement as these schools have more fundamental challenges, such as high teacher absenteeism.

Author's main message

Reducing class size is a popular education policy measure with parents, teachers, and policymakers. However, research shows that reducing class size leads to, in most cases, only modest improvements in student achievement. Also, students in early grades appear to gain more from smaller classes than older students. Despite extensive research on class size, much about this relationship is still unknown. Policymakers should be aware that reducing class sizes can be costly, is no guarantee of improved achievement, and is only one of many possible reforms.

Motivation

Class size is an extremely popular education reform among many stakeholders, including students, parents, teachers, school administrators, and educationalists. With such broad appeal, reducing class size is also popular among policymakers. Intuitively, students in smaller classes should have better learning outcomes than students in larger classes—for example, the teacher can provide more individualized attention in smaller classes, and classroom discipline is easier with fewer students [6].

At the same time, reducing class size is an expensive education policy. For example, a class-size reduction policy instituted in California in 1996 reduced class size from an average of 30 students per class to a maximum of 20 students per class, but as a result increased the number of teachers by approximately half [3]. To illustrate this calculation, consider a grade with 60 students; the legislation would increase the number of teachers from two to three, which is an increase of 50%. Teacher salaries constitute the vast majority of schools’ non-capital expenses, and in the first two years of California’s class-size reduction, over 25,000 additional teachers were hired.

Research indicates that the costs of reducing class size are more likely to exceed the benefits and that other education policies, such as tutoring, early childhood programs, or improving teacher quality, would be better investments [6].

In addition, a simple comparison of achievement across classrooms of different sizes will not reflect the true relationship between class size and student achievement, for a number of reasons. For example, in the US, where the biggest source of funding is local property tax revenue, schools in wealthier areas are more likely to have smaller classes and higher achievement due to students’ more advantaged backgrounds, rather than being a casual effect of smaller classes [6]. In contrast, if a school provides smaller classes for its most “at-risk” students, the result would be higher achievement in the larger classes, again for reasons unrelated to class size.

Therefore, a rigorous analysis of the causal relationship between class size and student achievement is complicated, as the number of students in each classroom is not determined randomly. And while smaller classes are generally associated with higher student achievement, the evidence is not universal.

Discussion of pros and cons

Influential research

It is quite challenging to isolate the effect of class size from other determinants of student achievement, as schools explicitly decide class sizes, and they often base class-size decisions, as well as the assignment of specific students to classes of different sizes, on prior student achievement (i.e. test scores). Consequently, researchers view the class-size experiment conducted in Tennessee in the late 1980s as the highest-quality study on the topic. Over 11,000 students and their teachers were randomly assigned between small classes of 15 students and regular classes of 23 students [6]. If it is assumed that class-size effects are linear (in order to make comparisons between studies with difference changes in class size), then the “per-pupil” effect of the Tennessee experiment was 0.048 standard deviations [6], [7]. In other words, each “one-student” reduction in class size is associated with an increase in student achievement of 0.048 standard deviations.

However, the bulk of research on the relationship between class size and student achievement is based on techniques other than random experiments. Such studies, often called “quasi-experimental” or “non-experimental,” attempt to isolate the effect of class size in several ways. One compelling way, first used in an analysis of Israeli schools, is to focus on small changes in student enrollment that correspond with changes in the number of teachers, thus leading to differences in class size [4]. This approach is often called the “maximum class-size rule” (or Maimonides’ Rule, after the 12th-century scholar who proposed it).

As an example of this, in Israel, the maximum number of students in a classroom is 40. This means that a school with 39 people in a grade has one teacher with a class size of 39, whereas a school with 42 people in a grade has two teachers with an average class size of 21. The underlying idea is that natural fluctuations in area population generate potentially large, and presumably random, changes in average class size. However, researchers and policymakers using this type of analysis should carefully investigate the extent to which these fluctuations in class size appear to be random. The study for Israel is very carefully investigated and shows that smaller classes are associated with sizable improvements in achievement for fifth-grade students, but with smaller effects for fourth-grade students. In terms of a “per-student” reduction, the effect is around 0.036 standard deviations for fifth grade and approximately 0.018 for fourth grade [4].

Another technique that has been frequently used is to study yearly fluctuations in class size, thereby attempting to isolate presumably random changes in class size that occur as a result of student population variation (i.e. students moving into and out of schools and classrooms). This approach typically involves using detailed data on a large number of students over multiple years, as a given grade or school may have very little fluctuation between one year and the next. This approach was first applied to data on students in late elementary grades in Texas [2]. In the fifth grade, a one-student reduction in class size is associated with an increase of 0.0055 standard deviations in mathematics and 0.0043 standard deviations in reading (in their most sophisticated analyses). In the sixth and seventh grades, the authors could not refute the proposition that class size is unrelated to student achievement. This approach has also been used to estimate the effect of class-size reduction in California, with effects of 0.006−0.01 standard deviations for third grade reading and mathematics, respectively [3].

A similar approach looked instead at variations in the population of school-age children, rather than the actual student population. Estimates from this approach are much smaller than those using more detailed student-level data. In Connecticut, for example, there is no evidence that reductions in class size are associated with gains in student achievement, using data from the 1980s and 1990s [1]. The results are similar when using the maximum class-size rule, referred to earlier.

Further evidence in the US

A class-size reduction program was also conducted in Florida. This was undertaken across all grades (as opposed to the Californian study, which focuses on kindergarten through third grade) and had both district-level and school-level components. The results show little, if any, improvement in achievement resulting from the reductions in class size [8].

In Minnesota, researchers used changes in the school-age population (as opposed to the actual school population) to study the relationship between elementary school class size and student achievement. However, over half the schools have missing data on either the school-age population or the class size. Among the schools without missing data, a “one-student” reduction in class size is associated with smaller achievement gains than those found in the most influential studies of class size.

Several studies use state-level datasets with detailed information on class size and student achievement, as well as student demographic information and information on teachers. These data typically cover entire school districts, or even states, such as New York, North Carolina, and San Diego. The data are available for multiple years. With these data, researchers studied the determinants of student achievement in general, often with a focus on teacher characteristics. Almost all of these studies include class size, even if it is not the main focus of the analysis. The majority of these studies find a negative relationship between class size and student achievement as measured by standardized test scores, indicating that the bigger the class size the lower the test scores. In other words, the research typically concludes that smaller classes are associated with higher student achievement.

Studies in Europe

In Europe, the most influential studies use the maximum class-size rule. In Sweden, a “one-student” reduction in class size in grades four to six is associated with an increase in test scores in mathematics and Swedish, at ages 13 and 16, of 0.023−0.033 standard deviations respectively [5]. Similarly, in France, numerous researchers have applied this technique and identified a smaller, positive relationship between smaller classes and student achievement, both in elementary and secondary grades [9]. Two studies in Denmark, using different data sets and statistical techniques, also demonstrate small benefits of reduced class sizes for both test scores and year of schooling.

However, the findings from Europe are far from universal. A study of 11 countries, predominantly in Europe, shows substantial cross-country variation in the relationship between class size and student achievement, with most countries having a small or no benefit from smaller class sizes [10]. In Norway, two studies using the maximum class-size rule obtain different results. One study finds a positive effect of smaller class sizes on student achievement in the early years of secondary school, whereas another study, using similar methods (on a larger data set), essentially finds no effect. Using a variant of the maximum class size based on the government’s school funding formula, researchers were not able to discern a clear relationship between class size and student achievement for fourth, sixth, and eighth grade students in the Netherlands.

Studies outside the US and Europe

Class-size research is rare outside the US and Europe. In Japan, smaller classes are associated with higher achievement in fourth and sixth grades, but there is no evidence of a positive relationship for ninth grade. As of July 2015, there are, to the author’s best knowledge, no studies on class size in Canada, Australia, or New Zealand in the main database of economics literature, .

In developing countries, few high-quality studies of class size exist. One of these high-quality investigations is for Bolivia, which finds a positive relationship between student achievement and smaller classes [11].

In general, the problem of isolating the effect of smaller classes from other factors is more challenging in developing countries than in developed countries. Basic services, such as having a teacher (or even a substitute teacher) are often missing in schools in many locations. If the teacher is not present, then the size of the class is irrelevant.

In Kenya, for example, a reduction in class size from 82 to 44 is not associated with improved achievement, but the use of a locally hired contract teacher (i.e. a teacher who is hired on an annual, renewable contract) is associated with improved achievement. The likely explanation for this is that absenteeism is much lower among contract teachers.

Therefore, in many developing countries, more fundamental issues, such as adequate staffing and facilities, need to be addressed before focusing attention on possible class size effects on student achievement.

Do certain types of students benefit more than others?

The takeaway message from the existing research is that smaller classes are associated with improved student achievement more often than not, though some high-quality studies find no relationship. But what does past research tell us about whether some students benefit more than others from smaller classes?

As with the overall pattern of results, smaller classes do not clearly benefit (or harm) specific groups of students. The studies finding no relationship between class size and achievement generally report analyses for different types of student groups, and none of them finds that any particular group of students would benefit significantly more from smaller classes.

The closest thing to consensus on class-size effects is that when smaller classes are beneficial, they tend to be more beneficial for younger students than for older students. This finding is demonstrated, for example, by empirical research in Texas and Japan.

Some evidence suggests that disadvantaged students receive larger benefits, as shown in the class-size experiment in Tennessee and the maximum class-size rule in Israel [4], [5]. However, this pattern of results is not echoed elsewhere in the literature. In Europe, the effects do not appear to differ much by student demographics such as gender, parental income, race, and ethnicity. In Japan, the benefits associated with smaller class sizes appear to be larger for wealthy students and for class sizes under 20 students.

Cost–benefit analysis

It is important to measure the potential benefits of smaller classes against their costs. Assuming there is no change in enrollment, a reduction in class size corresponds with an increase in the number of classrooms. Thus, the two primary costs of reducing class sizes are the cost of additional teachers and the cost of creating additional classroom space. On this basis, studies from the US suggest that each “one-student” reduction in class size has a cost of $200−250 per pupil [6]. However, it is important to consider that there may also be additional costs involved, such as electricity and other costs of operation.

Another approach in estimating costs is a simple “back-of-the-envelope” calculation, with the assumption that all costs are variable. In other words, a 10% reduction in class size would produce a 10% increase in per-student costs. Under this simplistic approach, the estimated cost of a “one-student” reduction in class size is even higher, in excess of $400 per student [6].

In addition, a comprehensive cost–benefit analysis of smaller classes should include all benefits of smaller classes, including short- as well as longer-term improvements in achievement. These benefits would also include future values for students, such as increased earnings, decreased unemployment, improved health, etc.

US studies that compare costs and benefits usually find that the total benefits of smaller class sizes—not just the benefits associated with improved achievement—do not exceed the costs. Even in the study with the largest effects of smaller classes (the experiment in Tennessee), the benefits are roughly equal to the costs [6], [7]. In other programs, where the perceived benefits are noticeably smaller, the benefits are even lower relative to the costs.

The cost–benefit ratio seems even less favorable in Europe, where the predicted effects of smaller classes are less consistent than in US studies. That being said, however, the study from Sweden argues that the benefits of smaller classes do exceed the costs [5]. This study follows students from school until they are middle-aged. It can therefore directly estimate the increase in wages associated with reduced class sizes. The corresponding estimates show larger wage increases than the indirect effects used elsewhere in the literature; this includes indirect effects measured in Sweden (when the actual wage data are not used). Although other studies do not explicitly model the cost–benefit analysis, the smaller benefits of class size suggest that the benefits do not significantly outweigh the costs. However, the study from Sweden shows that predicting the effects of smaller classes on future wages will produce “estimated” wage gains that are smaller than the “actual” wage gains. It is unclear whether researchers would find a similar relationship in other countries.

The “opportunity cost” of reducing class size

So far, the discussion of costs and benefits has been in terms of monetary costs. An economic cost–benefit analysis, however, would compare the benefits and costs of class size to those of an alternative use of the money [6]. The comparison with the “next-best” alternative use of money is known in economics as the “opportunity cost.” With this in mind, given that the benefits of reducing class size do not exceed the costs in nearly all studies, a rational conclusion would be that smaller class sizes do not exceed the benefits of the next-best alternative: i.e. that the “opportunity cost” is not significant.

For most of the studies reviewed in this contribution, the costs of reducing class size generally exceed the benefits, with the exception of the study from Sweden [5]. Studies looking at the costs and benefits of smaller classes (i.e. the “opportunity cost” of reducing class size) have concluded that other education policies, such as tutoring, early childhood programs, or improving teacher quality, would be better investments [6]. For example, the study of class size and teacher characteristics in Texas concluded that improving teacher quality, such as replacing the most ineffective teachers, would have significantly large returns—much larger than any conceivable class-size reduction program [2]. However, in many countries, including the US, the ability to replace low-quality teachers is complicated by the tenure system, which protects teachers from dismissal; teachers with tenure are quite difficult, if not impossible, to remove from their positions. Alternatively, reassigning ineffective, tenured teachers to non-classroom duties is very expensive.

Limitations and gaps

The Tennessee experiment is considered the “gold standard” for class-size research, in large part because it is the only sizable class-size experiment that has been conducted since the early 20th century. However, even random experiments, such as this, have their limitations. Fewer than half the students randomly assigned to small or regular classes in kindergarten or pre-school are recorded in the data four years later [6], [7]. Hence it is not possible to track students over time. In addition, only schools with at least three classrooms per grade are included in the study, which results in an overrepresentation of urban schools and of schools with sizable non-white student populations. Both are likely to bias results.

Another potential concern is that teachers assigned to smaller classes may exert extra effort to increase the likelihood of being assigned to smaller classes in the future [1], [6]. However, there is evidence that, even among the regular classes, those with slightly smaller class sizes have better student outcomes than those with larger class sizes [6], [7].

Studies using other techniques, such as the maximum class-size rule, also have their limitations. The studies in Sweden and Bolivia are limited to smaller schools, so it is not clear whether class size has similar effects in larger schools in these countries [5], [8]. One study from Japan is based on one year of data (2002) only, while another study is limited to the city of Yokohama. Again, there are concerns about whether the class-size effects are similar in other years and in other parts of Japan.

Another major limitation is the issue of “external validity.” In other words, it is not clear to what extent the results from one study in one location or time period can inform the likely benefits of smaller classes in a different location or time period, or in a school of a different size or in a different country.

To illustrate the point, consider two of the highest-quality studies: the Tennessee experiment and the Israeli study using the maximum class-size rule. In Tennessee, the class-size experiment compared students in “regular” classes of 23 with students in “small” classes of 15. In Israel, the maximum class-size rule is 40 students. Therefore, one of the smallest possible class sizes in Israe l—21 students (say in a grade of 42 students)—would be considered a “regular” class size in Tennessee (or elsewhere, such as Denmark). The few high-quality studies that exist only study a small set of possible changes in class size. Consequently, research provides only little insight on the effectiveness of class size across the possible, or even probable, distribution of class sizes. Researchers simply cannot identify the optimal class size (yet).

Another fundamental gap in the literature is the lack of high-quality studies in secondary schools [6]. Given the above concerns regarding external validity, the studies from primary schools are of little use when evaluating the relationship between class size and student achievement in secondary schools, particularly in the later grades of secondary school (grades 10−12). One challenge in studying class size in secondary school is that, in most locations, students in secondary schools have different teachers for different subjects. For outcomes such as subject-matter test scores, the class size in that subject is the most relevant, but the relevant class-size measure for overall outcomes, such as graduation or grade-point-average, is less clear.

An additional gap in the research is the absence of studies in many locations. Clearly, more high-quality research is needed in places where little, if any, such research is currently available, such as in Germany or Australia. However, identifying the relationship between class size and achievement can be challenging in some settings. For example, many schools in Ireland, particularly in urban areas, have very little variation in class size due to school policies. Consequently, the techniques used elsewhere, such as the maximum class-size rule, are unlikely to work in Ireland, as there may not be enough classrooms with different sizes to compare achievement.

Summary and policy advice

In summary, smaller classes are generally associated with higher student achievement, but the evidence is far from unanimous. The few studies that find no effect of smaller classes use data from similar (if not identical) locations to studies that find positive effects of smaller classes. The studies, by and large, use the same statistical techniques. Thus, advocates of smaller classes cannot simply dismiss the studies finding no effect as being somehow inferior or being confined to particular locations. Instead, the conclusion for policymakers and researchers is that reducing class sizes is no guarantee of improved achievement, even though the majority of past research finds such a positive relationship.

Even in situations where smaller class sizes are associated with improved student achievement, resources may be better spent on other reforms, such as teacher “quality” (as opposed to teacher “quantity”).

In general, the effectiveness of education reforms is difficult—and in some cases virtually impossible—to estimate, especially as these reforms usually overlap. For example, in the late 1990s, California instituted several educational reforms, such as expanded school accountability, in addition to reducing class sizes.

Policymakers must also keep in mind that policies designed to change class size are likely to have unintended consequences. In line with this, the class-size reduction program in California led to a dramatic increase in the number of teaching positions in the state. As a consequence, many teachers in low-performing schools in poor neighborhoods left for newly created positions in higher-quality schools in more affluent areas, leaving the low-performing schools to hire new teachers with less experience (and, presumably, less ability) [3].

Also, in response to strong financial incentives to keep class sizes as small as possible, many schools in California combined students from different grades into the same classroom. These multi-grade classrooms tended to have lower achievement than otherwise similar classrooms with students from only one grade.

So, what should policymakers do regarding class size? Many stakeholders in education, including teachers and parents, strongly believe that smaller classes are better for students. However, policymakers should be aware that reducing class size is an expensive reform that will not automatically increase student achievement, although it will likely please teachers, parents, and students. They should consider class size as only one of many possible reforms. A more holistic approach would be to consider the potential costs and benefits of many possible reforms, with the understanding that these costs and benefits are imprecise at best and completely unknown at worst.

Many parts of the world have little or no information on the effectiveness, or ineffectiveness of smaller classes. For developing countries, schools often face more fundamental challenges, such as teacher absenteeism, that will reduce (if not eliminate) any benefits of smaller classes. In these places, policymakers would be better served to solve the more pressing issues before turning their attention to potential class size effects.

If policymakers wish to understand the relationship between class size and student achievement in their area, then the easiest, quickest, and least costly way to determine that is to begin by studying the data available to them. High-quality data increase, but do not ensure, the likelihood that researchers can identify the past relationship between class size and achievement. The possibility of conducting a class-size experiment, as in Tennessee, is extremely challenging and expensive, and it still may not provide conclusive evidence on the effects of smaller classes, even after years of study.

In sum, the relationship between class size and achievement is not clear. Research suggests that policymakers should be aware that reducing class sizes is no guarantee of improved achievement and is only one of many possible reforms.

Acknowledgments

The author thanks an anonymous referee and the IZA World of Labor editors for many helpful suggestions on earlier drafts. Previous work of the author contains a larger number of background references for the material presented here and has been used intensively in all major parts of this article [7].

Competing interests

The IZA World of Labor project is committed to the IZA Guiding Principles of Research Integrity. The author declares to have observed these principles.

evidence map

Class size: Does it matter for student achievement?

Class size: Does it matter for student achievement?

Smaller classes are often associated with increased achievement, but the evidence is far from universal

Elevator pitch

Key findings

Pros

Cons

Author's main message

Motivation

Discussion of pros and cons

Influential research

Further evidence in the US

Studies in Europe

Studies outside the US and Europe

Do certain types of students benefit more than others?

Cost–benefit analysis

The “opportunity cost” of reducing class size

Limitations and gaps

Summary and policy advice

Acknowledgments

Competing interests

evidence map

The promises and pitfalls of universal early education

The role of preschool in reducing inequality

How effective is compulsory schooling as a policy instrument?

Is teacher certification an effective tool for developing countries?

Understanding teacher effectiveness to raise pupil attainment

Class size: Does it matter for student achievement?

Smaller classes are often associated with increased achievement, but the evidence is far from universal

Elevator pitch

Key findings

Pros

Cons

Author's main message

Motivation

Discussion of pros and cons

Influential research

Further evidence in the US

Studies in Europe

Studies outside the US and Europe

Do certain types of students benefit more than others?

Cost–benefit analysis

The “opportunity cost” of reducing class size

Limitations and gaps

Summary and policy advice

Acknowledgments

Competing interests

evidence map

Full citation

Full citation

Data source(s)

Data type(s)

Method(s)

Countries