AP Syllabus focus:
‘Intelligence test scores have been used to restrict access to jobs, military ranks, schools, and U.S. immigration.’
Intelligence testing has shaped major U.S. institutions by guiding selection and placement decisions. This page focuses on how tests were historically used—and misused—when scores were treated as fixed, objective measures of human worth.
What “intelligence tests” were used for historically
Early intelligence tests were developed to classify performance and predict outcomes, then quickly expanded into high-stakes gatekeeping.
From educational tool to social sorting
Tests shifted from identifying students needing support to ranking people for scarce opportunities.
Scores were often treated as stable traits, rather than outcomes influenced by schooling, language, and environment.
Major historical uses and misuses (AP focus areas)
Jobs and employment access
Employers used test scores to make hiring and promotion decisions.
Use: screen applicants efficiently.
Misuse: exclude groups when tests were culturally loaded or unrelated to job performance, turning selection into systemic restriction rather than prediction.
Military ranks and placement
Large-scale testing (e.g., WWI-era group tests) influenced assignment and leadership decisions.

A scanned Army Alpha group test booklet (Form 7) used for large-scale classification of U.S. Army recruits and later reprinted for school use. The page shows timed, standardized item formats typical of mass testing, which helps explain why performance could be influenced by literacy, familiarity with test conventions, and educational opportunity—not just underlying ability. Source
Use: classify recruits for roles.
Misuse: treat quick group-administered scores as definitive “ability,” despite language barriers, limited education, anxiety, and unfamiliar test formats that could depress performance.
Schools (placement, tracking, and access)
Schools used scores for tracking, special program placement, and admissions.
Use: match instruction to student needs.
Misuse: lock students into lower tracks with fewer resources and reduced expectations, creating self-fulfilling outcomes; decisions sometimes relied on a single score rather than multiple measures and reevaluation.
U.S. immigration restriction
Intelligence tests were used to justify limiting entry to the United States.

A world map summarizing immigration quota areas under the Immigration Act of 1924, including regions effectively barred or tightly limited. Seeing quotas as geographic categories highlights how policymakers translated group-level assumptions into rigid national-origin restrictions rather than individualized assessments. Source
Use: claim to identify “undesirable” entrants.
Misuse: ignore cultural and language differences, then generalize group score differences into claims about national or ethnic “fitness,” supporting restrictive immigration policies.
The role of eugenics in test misuse
Some test interpretations fed into the eugenics movement, which argued society should “improve” heredity by controlling reproduction and membership.
Eugenics: A movement claiming human traits (including intelligence) should be improved by encouraging reproduction of “fit” people and restricting reproduction and rights of those labeled “unfit.”
Eugenic thinking encouraged policies that treated test scores as biological destiny, strengthening discriminatory practices in schooling, work, and immigration.
Why misuse was persuasive: key reasoning errors
Reification and determinism
Reification: treating an abstract construct (“intelligence”) as a concrete, single thing measured perfectly by a score.
Determinism: assuming scores reflect fixed genetic limits rather than modifiable skills and opportunities.
Validity ignored or overstated
Tests were assumed to measure “native intelligence” even when they assessed language, schooling, and cultural knowledge.
Institutions often skipped the central question: Does this test predict success in this specific context, fairly, for all groups tested?
Policy overreach
Group averages were used to justify individual exclusion.
Test results were used to support broad social policies (who may lead, learn, work, or enter a country) beyond what the measures could legitimately conclude.
Ethical and scientific takeaways for AP Psychology
Intelligence tests can inform decisions, but historical misuse shows the danger of attaching moral worth and civil rights to a single score.
High-stakes uses require careful attention to fairness, validity evidence, appropriate norms, and multiple sources of data before restricting opportunity.
FAQ
WWI-era group testing popularised large-scale, standardised administration and rapid scoring.
It also normalised using brief scores for major placement decisions, shaping later civilian testing practices.
Testing conditions were often rushed and stressful, with major language barriers.
Tasks could reflect unfamiliarity with U.S. culture and schooling rather than underlying reasoning ability.
They treated scores as evidence of hereditary “fitness,” then argued for social control policies.
This reframed a test score into a claim about biological worth and social value.
Yes; several cases and policies challenged discriminatory identification and placement practices.
These disputes pushed schools towards broader assessment practices and scrutiny of test bias.
Professional standards emphasise validity for purpose, fair comparisons, appropriate norms, and transparent interpretation.
Common safeguards include accommodations, multi-method assessment, and monitoring for adverse impact.
Practice Questions
Explain one way intelligence testing was historically misused in relation to U.S. immigration. (2 marks)
1 mark: Identifies a misuse linked to immigration (e.g., using test scores to deny entry or justify restrictive policies).
1 mark: Explains why it was a misuse (e.g., language/cultural bias, invalid inference about “fitness,” unfair generalisation).
Discuss how intelligence test scores were used to restrict access in two different settings (jobs, military ranks, schools, or U.S. immigration), and explain one key assumption/error that enabled misuse. (6 marks)
1 mark each (2 marks): Describes restriction in two distinct settings.
1 mark each (2 marks): Explains how the test score functioned as a gatekeeper in each setting.
Up to 2 marks: Explains one enabling assumption/error (e.g., reification, determinism, ignoring validity/cultural loading) clearly linked to the misuses.
