How to Read Education Data Without Jumping to Conclusions (Jessica Lahey & Tim Lahey)

In an earlier post, I offered fundamental questions that parents, teachers, administrators, researchers, and policymakers could (and yes, should) ask of any policies being considered for improving classroom teaching and student learning.

In this post, a teacher and M.D. offer the basic questions that educators and non-educators should ask of any research study tweeted, blogged about, and appearing in newspapers or on TV programs.

Jessica Lahey is an English, Latin, and writing teacher in Lyme, New Hampshire. She writes about education and parenting for The New York Times and on her site, Coming of Age in the Middle. Tim Lahey, MD, is an infectious diseases specialist and associate professor of medicine at Dartmouth’s Geisel School of Medicine

This piece appeared on AtlanticOnline, Jul 8 2014

Education has entered the era of Big Data. The Internet is teeming with stories touting the latest groundbreaking studies on the science of learning and pedagogy. Education journalists are in a race to report these findings as they search for the magic formula that will save America’s schools. But while most of this research is methodologically solid, not all of it is ready for immediate deployment in the classroom.

Jessica was reminded of this last week, after she tweeted out an interesting study on math education. Or, rather, she tweeted out what looked like an interesting study on math education, based on an abstract that someone else had tweeted out. Within minutes, dozens of critical response tweets poured in from math educators. She spent the next hour debating the merits of the study with an elementary math specialist, a fourth grade math teacher, and a university professor of math education.

Tracy Zager, the math specialist, and the author of the forthcoming book Becoming the Math Teacher You Wish You’d Had, emailed her concerns about the indiscriminate use of education studies as gospel:

Public education has always been politicized, but we’ve recently jumped the shark. Catchy articles about education circulate widely, for understandable reason, but I wish education reporters would resist the impulse to over-generalize or sensationalize research findings.

While she conceded that education journalists “can’t be expected to be experts in mathematics education, or science education, or literacy education,” she emphasized that they should be held to a higher standard than the average reader. In order to do their jobs well, they should not only be able to read studies intelligently,“they should also consult sources with field-specific expertise for deeper understanding of the fields.”

After she was schooled on Twitter, Jessica called up Ashley Merryman, the author of Nurture Shock: New Thinking About Children, and Top Dog: The Science of Winning and Losing. “Just because something is statistically significant does not mean it is meaningfully significant,” Merryman explained. “The big-picture problem with citing the latest research as a quick fix is that education is not an easy ship to turn around.” When journalists cite a press release describing a study without reading and exploring the study’s critical details, they often end up oversimplifying or overstating the results. Their coverage of education research therefore could inspire parents and policymakers to bring half-formed ideas into classroom. Once that happens, said Merryman, “the time, money, and investment that has gone into that change means we are stuck with it, even if it’s later proven to be ineffective in practice.”

As readers and writers look for solutions to educational woes, here are some questions that can help lead to more informed decisions.

1. Does the study prove the right point?

It’s remarkable how often far-reaching education policy is shaped by studies that don’t really prove the benefit of the policy being implemented. The Tennessee Student Teacher Achievement Ratio (STAR) study is a great example.

In the late 1980s, researchers assigned thousands of Tennessee children in grades K-3 to either standard-sized classes (with teacher-student ratios of 22-to-1) or smaller classes (15-to-1) in the same school and then followed their reading and math performance over time. The landmark STAR study concluded that K-3 kids in smaller classes outperformed peers in larger classes. This led to massive nationwide efforts to achieve smaller class sizes.

A key step is to avoid extrapolating too much from a single study.

Subsequent investigations into optimal class size have yielded more mixed findings, suggesting that the story told in STAR was not the whole story. As it turns out, the math and reading benefits experienced by the K-3 kids in Tennessee might not translate to eighth grade writing students in Georgia, or geography students in Manhattan, or to classes taught using different educational approaches or by differently skilled teachers. A key step in interpreting a new study is to avoid extrapolating too much from a single study, even a well-conducted one like STAR.

2. Could the finding be a fluke?

Small studies are notoriously fluky, and should be read skeptically. Recently Carnegie Mellon researchers looked at 24 kindergarteners and showed that those taking a science test in austere classrooms performed 13 percent better than those in a “highly decorated” setting. The authors hypothesized that distracting décor might undermine learning, and one article in the popular press quoted the researchers as saying they hoped these findings could inform guidelines about classroom décor.

While this result may seem to offer the promise of an easy 13-percent boost in students’ learning, it is critical not to forget that the results may come out completely different if the study were replicated in a different group of children, in a different school, under a different moon. In fact, a systematic review has shown that small, idiosyncratic studies are more likely to generate big findings than well-conducted larger studies. Would that 13 percent gap in student performance narrow in a larger study that controlled for more variables?

In other words, rather than base wide-reaching policy decisions on conclusions derived from 24 kindergarteners, it would seem reasonable, for now, to keep the Jane Austen posters and student art on the classroom wall.

3. Does the study have enough scale and power?

Sometimes education studies get press when they find nothing. For instance, Robinson and Harris recently suggested that parental help with homework does not boost academic performance in kids. In negative studies like these, the million-dollar question is whether the study was capable of detecting a difference in the first place. Put another way, absence of evidence does not equal evidence of absence.

Absence of evidence does not equal evidence of absence.

There are multiple ways good researchers can miss real associations. One is when a study does not have enough power to detect the association. For example, when researchers look for a rare effect in too small a group of children, they sometimes miss the effect that could be seen within a larger sample size. In other cases, the findings are confounded—which means that the factor being studied is affected by some other factor that is not measured. For example, returning to Robinson and Harris, if some parents who help their kids with homework actually do the kids’ homework for them while others give their kids flawed advice that leads them astray, then parental help with homework might appear to have no benefit because the good work of parents who help effectively is cancelled out by other parents’ missteps.

It’s always a good idea to check whether a negative study had enough power and scale to find the association it sought, and to consider whether confounds might have hidden—or generated—the finding.

4. Is it causation, or just correlation?

It turns out that the most important way for parents to raise successful children is buy bookcases. Or at least this is what readers could conclude if they absorbed just the finding summarized in this Gizmodo article, and not the fourth paragraph caveat that books in the home are likely a proxy for other facets of good parenting—like income, emphasis on education, and parental educational attainment.

Correlation—in this case, of bookshelves at home with achievement later in life—does not indicate causation. In fact, it often does not. The rooster might believe it causes the sun to rise, but reality is more complex. Good researchers—such as the original authors of the bookcase study—cop to this possibility and explain how their results might only refer to a deeper association.

No research study is perfect, and all of the studies we cited above have real merit. But, by asking good questions of any research finding, parents and journalists can help bring about sounder conclusions, in life and in policy-making. It’s easy to believe catchy, tweet-able headlines or the pithy summaries of institutional press releases. But since our kids’ education ultimately depends on the effectiveness and applicability of the available research, we should ensure that our conclusions are as trustworthy and formed as they can possibly be.

8 responses to “How to Read Education Data Without Jumping to Conclusions (Jessica Lahey & Tim Lahey)”

Alice in PA

August 11, 2014 at 7:30 am

This is fantastic! I would add that quantitative research is not the only research out there. Qualitative research offers some insight into the reasons those statistical changes happened. These studies are rarely reported, I think because they tend to be smaller and messier and therefore do not lend themselves to big headlines and generalizations. As a teacher and researcher I find a lot of useful ideas in these studies.
Also, just like in physical science, a canonical question involves the time dependent case. Do the effects hold up over time? Some studies of Head Start have evidence that the effects disappear by a few years but then others have looked into high school and found long term effects that others missed.

- larrycuban
  
  August 11, 2014 at 7:47 am
  
  Thanks, Alice, for the reminder of the critical role that qualitative studies play in educational research. Such studies, as you know, lack the zip (possible causal connection to behavior) that media needs in reporting findings that will play well with viewers and readers.
  
Daun Kauffman

August 11, 2014 at 7:13 pm

Bless you, Bless you, Bless you larrycuban !

- larrycuban
  
  August 11, 2014 at 8:31 pm
  
  Thank you, Daun, for the comment.
  
John Weisenfeld

August 12, 2014 at 8:10 am

Larry, thanks for boiling “healthy skepticism” down to a few good points. As a high school science teacher, I almost want to build a whole unit around your 4 points. I think kids can get these concepts, at which point they would be empowered.

As a math-endorsed teacher, I hearing that argument again that perhaps AP Statistics should be the pinnacle achievement of high school math in a big-data (con/in)fused world, instead of AP Calculus. The physics teacher in me would regret that, but perhaps there’s some good gains to be made from some math teacher+science teacher collaboration on lessons around these topics.

They certainly would be timely–thank you for including some recent brouhahas in your examples. I have no doubt that education won’t move forward in my state until many education stakeholders (dare I say parents? voters?) also develop some healthy big-data skepticism.

- Alice in PA
  
  August 12, 2014 at 9:44 am
  
  I completely agree with a larger emphasis on statistics. I think that we scientists deal with a lot of statistical phenomena. For example, kinetic theory of matter and radioactive decay.
  
Ann Staley

August 13, 2014 at 10:47 am

I’m not normally a reader of research papers, but I love John W’s idea of teaching students Larry’s Four Question Paradigm. I’ll happily read a narrative about these results, n = 180 or 50 or whatever.

- larrycuban
  
  August 13, 2014 at 8:10 pm
  
  Thanks, Ann, for taking the time to comment.

How to Read Education Data Without Jumping to Conclusions (Jessica Lahey & Tim Lahey)

8 responses to “How to Read Education Data Without Jumping to Conclusions (Jessica Lahey & Tim Lahey)”

Leave a comment Cancel reply

Pages

Visitors to this site

Published Posts

Email Subscription

Twitter Updates

How to Read Education Data Without Jumping to Conclusions (Jessica Lahey & Tim Lahey)

Share this:

Related

8 responses to “How to Read Education Data Without Jumping to Conclusions (Jessica Lahey & Tim Lahey)”

Leave a comment Cancel reply

Pages

Visitors to this site

Published Posts

Email Subscription

Twitter Updates