View stories by State
HOME RSS FEEDS ARCHIVES ABOUT US SITE MAP PUBLICATIONS
Search using      Advanced
Saturday, November 21, 2009
or Browse All States
CRIME & COURTS
ECONOMY & BUSINESS
EDUCATION
ELECTIONS
ENERGY
ENVIRONMENT
GOVS' SPEECHES
HEALTH CARE
HOMELAND SECURITY
POLITICS
RECESSION & RECOVERY
SOCIAL POLICY
TAXES & BUDGET
TECHNOLOGY
TRANSPORTATION
ARCHIVES
COMMENTARY
PUBLICATIONS
RSS FEEDS
STATE SPEECHES
NEWS ALERTS
PUBLIC POLICY LINKS
TOOLBARS
STATE BLOGS
ISSUE BLOGS


Register to comment on Stateline.org Stories

Thursday, January 17, 2008

Do state tests make the grade?

Comments Write the editor Print this story

It’s hard to overestimate the importance of standardized tests in public schools today. Grade advancement, high school diplomas, teacher bonuses, principals’ jobs and school reputations can all hinge on whether a student picks the right answer.

So who creates the tests that carry so much weight?


Order your copy today.
Much of the work is done by five giants: CTB/McGraw-Hill, Educational Testing Service, Harcourt Assessment, Pearson Educational Measurement and Riverside Publishing. Together, the companies own about 90 percent of the state-testing business, which has become a $1.1 billion industry since passage of the federal No Child Left Behind Act in 2001. The law, which took effect in January 2002, requires states to give annual reading and math tests to third- through eighth-graders, and to test students in those subjects once again in high school.

Working with state educators, the big five — or big four, once Pearson’s planned acquisition of Harcourt takes place — create and score the tests. But the explosion of testing and changes in the types of tests states administer have left the companies scrambling to keep up.

Also, differences in state standards that are used to create the tests and the reluctance of some states to spend money for high-quality, challenging tests have caused a great disparity in testing from state to state.

For example, a look at various fourth-grade reading tests shows wide differences. Texas’ 2006 reading test is entirely multiple choice. Ohio’s 2005 test includes several short-answer questions, such as asking for the main conflict in a passage; in another section, students fill out a cause-and-effect chart for a certain problem. Massachusetts’ 2007 test was arguably the most rigorous: Students had to answer four long open-response questions.

Some states have fewer questions that test writing skills. A main reason for that is money. Gary Cook, Wisconsin’s former testing director, said it could cost a thousand times more to score an essay question than a multiple-choice question. In 2005, the Editorial Projects in Education Research Center reported that 15 states relied entirely on multiple-choice questions in their reading and math tests. Some of these states gave separate writing tests in certain grades. (On Jan. 9, the center reported that 12 states use only multiple-choice questions on their math and reading tests.)

“People who don’t have their heads stuck in the instruction don’t realize it’s not cheap to do this really well,” Cook said of test-making. “And right now, I don’t know many legislatures that are very open to spending money or raising taxes to develop these kinds of instruments.”

Last year, the federal government gave states $407.6 million to help pay for testing. But states have said that falls short. In January, a federal appeals court revived a lawsuit that charges the federal government does not provide enough money for states and districts to meet the law’s requirements.

In the 2006-07 school year, Virginia spent in state and federal money about $11.92 per test, while Washington state spent about $17.74 per test. But these amounts also include the cost of additional tests the states administered that were not required by No Child Left Behind.

South Carolina doesn’t even calculate a “per pupil” number because not every student is tested. Despite these disparate ways of measuring spending, one thing is known: Of the total education budget combining federal, state and local money, less than one quarter of 1 percent goes to testing.

“States are not putting any more resources into the testing infrastructure, and as a result, we are getting testing on the cheap, and that is working against No Child Left Behind’s efforts to produce high-quality assessments that promote higher standards,” said Thomas Toch, the co-director of Education Sector, a nonpartisan think tank. “If we’re going to make tests the driver of quality in public education, then we need to invest to ensure that we get tests that are up to that task.”

On the whole, however, state spending on testing has shot up since George W. Bush’s education plan became the law of the land. In early 2001, a year before No Child Left Behind was enacted, states collectively spent almost $423 million on standardized tests, according to a Stateline.org report. During the 2007- 08 school year, states will spend almost $1.1 billion on these tests, according to Eduventures Inc., an education industry research firm.

The costs have been driven up by the sheer volume of testing required by the law. In 2005-06, when states had to have math and reading tests in place for all the required grades for the first time, about 45 million tests were administered throughout the country, 11.4 million more than the previous year. This year, states are required to add a science test, which is expected to add another 11 million tests to the total.

One side effect of the greater demand for tests is a shortage of educational professionals with a jawbreaking job title: psychometrician. Their task is to oversee test creation, administration and scoring. Competition for them is fierce among the test companies, and they’re often lured away from one company by another.

“A psychometrician has to make sure that algorithm is absolutely correct so that a student who is just barely going to pass, doesn’t just barely fail,” said Dr. Michael Bunch, a psychologist and senior vice president of research and development at North Carolina-based Measurement Inc., which has contracts with a dozen states.

The increased emphasis on testing has caused other pressures as well. States are now testing later in the academic year to squeeze in more teaching. And, states want scores back faster than ever before.

Not surprisingly, this has caused some high-profile errors. In 2004, CTB/McGraw-Hill had to re-score thousands of Connecticut tests after scores came in mysteriously low because of the grading on the writing section. In 2006, the late distribution of tests to Illinois by Harcourt meant a long delay in getting the scores back to the state. Last year, American Institutes for Research had to re-grade 98,000 Hawaii tests after teachers found that some students who submitted blank test books received scores anyway.

Probably the biggest impact of the No Child Left Behind law has been on the kind of tests states are giving, which has changed dramatically. The law, which imposed so much federal intrusion into local classrooms, passed only with a compromise: States would be allowed to create the tests.

The result was states switched from standardized tests that compare how their students stack up against students across the country — the Stanford Achievement Test is a prominent example — to those based on each state’s specific standards. But this system allows the difficulty of the tests to vary widely from state to state, resulting in some states producing easier tests that measure lower-level skills.

A report released in October by the conservative Thomas J. Fordham Foundation studied testing in 26 states by comparing students’ proficiency on two tests — their state tests and the Measures of Academic Progress (MAP) test, administered by the Northwest Evaluation Association (NWEA).

The results were stark: Differences in student performance from state to state were significant. According to NWEA’s analysis, to be considered proficient readers in Wisconsin, for example, fourth-graders needed to answer questions about as difficult as one that asked them to note a few differences between cats and dogs. But fourth-graders in Massachusetts faced more difficult questions such as those about a written passage by Russian author Leo Tolstoy.

Wisconsin Department of Public Instruction spokesman Patrick J. Gasper called the Fordham study flawed and pointed out that the two tests compared in the report have different goals and measure different skills.

“Their report attempts to draw conclusions by comparing two different types of tests that are scored on two different scales,” Gasper said. “The purpose and types of tests being compared are different.”

Test difficulty can be measured in a variety of ways. One is by looking at how tough the questions are. Again, a review of various fourth-grade reading tests shows that reading passages on Massachusetts’ test are lengthier and appear to be more detailed and use more difficult vocabulary than the passages on Ohio’s and Texas’ tests. One question on Ohio’s test simply asks students to identify who is speaking in the passage.

But that is an imprecise way to determine a test’s difficulty, said John Cronin, a research specialist at NWEA and one of the Fordham study’s lead authors. Several factors could make a reading question difficult, such as the length of the passage, how straightforward the question is and the quality of the wrong answers in multiple-choice questions. Test creators can determine how difficult a question is only after they see how it has performed in field tests with real students.

And even if the questions themselves are challenging, a test’s difficulty can be misleading if the state sets a low cut score, or the number of questions a student must answer correctly to be deemed proficient. The Fordham Foundation report found that states’ definition of “proficient” based on the cut score was far from consistent. Cronin said setting low cut scores would be like saying a high school baseball player is proficient only if he can hit against Major League pitcher Roger Clemens — an admittedly high performance standard — but then requiring batters to get only one hit out of 50 pitches to meet that standard.

“Instead of a .300 (batting average), they’re aiming for .020,” Cronin said. “It’s a very rigorous test, facing a very difficult pitcher, but the standard of performance is very low.”

These test disparities and the wide variety of tests has fueled calls for national standards.

“You may be deemed proficient in North Dakota math or Wisconsin math, but that doesn’t mean you’re proficient, really, in math,” said Fordham spokesman Jeffrey Kuhner. “If you want to prevent this varied discrepancy from state to state, rather than dumbing down our standards we believe that the way to do it is to have an across-the-board national test and national standards.”

International comparisons show that students in several countries have caught up to — and passed — American students. The results from the 2006 Program for International Student Assessment, which compares how 15-year-olds in 57 countries perform, showed U.S. students were 29th in science and 35th in math, behind counties like Estonia, Slovenia and Latvia.

As Congress works to authorize No Child Left Behind, several experts are promoting national standards, but the concept seems unlikely to advance in the face of staunch opposition by local-control advocates. When Fordham released an analysis showing how national standards could come about, it was titled, “To Dream the Impossible Dream.”

A bipartisan discussion draft proposing changes to NCLB released by key U.S. House members — most notably Rep. George Miller (D-Calif.), the chairman of the House Committee on Education and Labor and one of the prime movers of the original No Child Left Behind bill — would not require national standards or a national test. The draft, released in August, would give states incentives to work with universities and the business community to develop more rigorous standards and tests based on those standards.

“To be successful, our system of accountability must encourage states to set high standards,” Miller said in September at a House hearing. “Lowering the bar so more students can reach it is a sham.”


This article was excerpted from “State of the States 2008,” Stateline.org’s annual report on significant state policy developments and trends released Jan. 16. In parentheses are any news updates since the report was sent to the printer. Download a PDF version of the entire report here.

Comment on this story in the space below by registering with Stateline.org, or e-mail your feedback to our Letters to the editor section at letters@stateline.org.

Contact Pauline Vu at pvu@stateline.org.

See related stories:

Lake Wobegon, U.S.A. -- Where all the children are above average

Lawmakers balk at cost of fed education law

Special report: States pay $400 million for tests in 2001


Comment on this story in the space below by registering with Stateline.org.

Issues: Welfare & Social Policy    Politics    Education    Economy and Business    Taxes and Budget   
Topics: NCLB/No Child Left Behind    standardized tests    schools   

COMMENTS (1)
Most Recent Comments
More details needed
By K P on Oct 13, 2008 8:54:58 AM

There are three major flaws with this report.

The most significant of these is the lack of information provided by teachers and school administrators. There is none, as you will notice. This article is written to discuss the efficacy of the state tests, accuses the states of not providing high quality tests, and calls our standards sub par. These statements are made by congressmen rather than experts in education. The greatest critic is "Rep. George Miller (D-Calif.), the chairman of the House Committee on Education and Labor and one of the prime movers of the original No Child Left Behind bill." You will note that he does not have a doctorate in educational research, but claims to be the voice of what is really needed in our public schools.

This article would have been more satisfactory in this respect if someone would have actually talked to teachers and administrators. It is unjust for people who are not in the public education system with our high stakes testing environment to say that we are failing.

Better yet, why don't we just ask the students? The best to ask would be those who have thrown up before the test. Or maybe we could ask those sixth and seventh graders who have peed themselves during the test because they were so nervous that their school would not pass. If you talk to these students, you will find that they are not always the low functioning students that the school is so concerned about, but the average students who feel the weight of the tests so strongly. Maybe the best students to ask would be those who are at-risk or have disabilities. I know that they would tell you that no matter how hard they try, they just can't do as well as their peers. That's because this test isn't designed for them. It is designed for 'normal' students.

Another problem with this report is the misrepresentation of financial figures in this report. It does note how much money the big textbook companies are taking in as a result of this legislated standard for education, but not in full. It does not recognize the expense of the materials used to prepare students for the test. These include review guides specifically tailored to match state standards and practice assessments.

The final problem that I will note here relates to one of the sources cited in the report: Eduventures Inc. I went to this sites website but was unable to find any reference to the information contained in this article. While that does not mean that the information doesn't exist, it also does not point specifically to where this information may verified. As we have all learned, if a source cannot be verified, it cannot be trusted.

Please just keep in mind as you read these articles that they are written by journalists with their own opinions and beliefs. Remember that all information must be verified rather than taken at face value. It is articles such as this, with facts omitted or misrepresented, that have guided the general public to believe that we as educators are not doing our jobs.

Report as Offensive
Read More Comments
Recession and Recovery
Read the latest news, analysis and research on the economic crisis in the states in Stateline.org's new Recession and Recovery special section.
The Stimulus and the StatesThe Stimulus and the
States

Follow how states are managing the stimulus money and which programs are receiving funding as part of the recovery effort using Stateline.org's stimulus special section.
Stateline Blogs
Stateline.org has compiled an extensive list of state issue political blogs to make it convenient for you to follow state government.

If a blog you find interesting and informative is not on our list, tell us about it by sending an email to editor@stateline.org.
Blogs organized by Issue
lineBlogs organized by State
State Public Policy Resources
Stateline.org has put together a list of state public policy resources organized by issue. Here, you will find useful links to essential information from government, academia, and think tanks. If you have a link to add, please email us.


The Pew Charitable Trusts applies the power of knowledge to solve today’s most challenging problems. Pew's Center on the States identifies and advances state policy solutions.