Please understand this: the Mississippi “miracle” does not show up in performance on the test that serves as the screening mechanism that determines if students move on to take the NAEP, where the supposedly miracle is seen. So if there's no effect pre-screen and a large effect post-screen, what can we conclude? That the observed effect is the product of the screen itself. It's just more manufactured selection bias. It always is.