Table of Contents
- Key Highlights
- Introduction
- How an Exam Defines the Range of Movement: autonomy under compression
- Polarized competence: when the test rewards some and alienates others
- From teammates to rivals: the social reconfiguration of relatedness
- Extrinsic dominance: exercise reframed as a score-getting tool
- Teachers squeezed between pedagogy and performance
- What works in the classroom: practical autonomy-supportive strategies
- Policy alternatives: de-linking long-term health from single high-stakes measures
- Equity implications: who wins and who loses under a rigid exam model
- Long-term consequences: from short-lived gains to habits that fade
- Teachers as change agents—and the supports they need
- Research gaps and next steps
- How meaningful reform might look in practice
- Voices from the field: lived evidence of tension and care
- FAQ
Key Highlights
- The Physical Education Entrance Examination (PEEE) has improved short-term fitness for some students but systematically compresses autonomy, polarizes competence, and weakens relatedness, producing mainly extrinsic, score-driven motivation.
- Teachers face persistent conflict between educational ideals and score-based accountability; policy reform toward diversified, process-oriented assessment and autonomy-supportive pedagogy is needed to protect long-term exercise participation.
Introduction
China's decision to include physical education in the high school entrance scoreline has changed what happens in gymnasiums and tracks across the country. Where PE once served multiple purposes—skill learning, social play, health promotion, recreation—it now occupies a dual role as a component of high-stakes selection. That shift reverberates through class organization, teaching practice, family investment and, most importantly, students' reasons for exercising.
A qualitative study of 24 junior-high students and eight PE teachers from four diverse schools used Self-Determination Theory (SDT) to map how the PEEE affects the three psychological needs that sustain motivation: autonomy (choice and volition), competence (feeling effective), and relatedness (social connection). The results show a complex picture: measurable fitness gains for some, but a pervasive re-framing of PE as a score-extraction exercise that undermines internalized, long-term participation. High-stakes assessment has produced short-term legibility and results at the cost of transforming the educational and social logic of PE.
This article synthesizes those findings, situates them within broader comparative practice, and outlines concrete classroom and policy alternatives that could reconcile fitness goals with the psychological foundations needed for lifelong physical activity.
How an Exam Defines the Range of Movement: autonomy under compression
One of the clearest patterns is a narrowing of choice. Students and teachers repeatedly described PE lessons that are organized around exam items—running drills, sit-ups, rope-skipping—marginalizing sport units students enjoy (dance, basketball, badminton, swimming). A student in Shanghai summarized that dynamic: “We basically just practice running, rope skipping, sit-ups—everything revolves around the exam. Every time I go to PE, I don't feel like I'm in a class; I feel like I'm in exam prep.”
Autonomy in SDT refers to the experience of acting with a sense of choice and endorsement. When curriculum decisions are determined by standardized test items and time-to-exam countdowns, the opportunity for students to select activities that align with their interests vanishes. The consequence is behavioral compliance rather than self-endorsed action: students do what they are required to do to avoid penalties or to secure points, not because they value the activity for itself.
Teachers themselves narrated the institutional forces that close down options. One teacher admitted that even if they preferred a diverse curriculum, school targets and parental scrutiny push instruction back to test events as soon as the exam draws near: “If the scores are not good, they come to question the teacher, so we basically don't dare to teach anything else.” The effect is structural rather than idiosyncratic—a routine that replicates across schools regardless of local intentions.
Real-world comparison sharpens the contrast. Many Australian and British schools emphasize formative assessment and student choice in PE, allowing selections from dance, swimming, or team sports as valid assessment pathways. That flexibility aligns assessment with autonomy support and has been associated with stronger intrinsic motivation and leisure-time activity outside school. In China, by contrast, the standardized item list restricts student movement both literally and motivationally: students who prefer swimming, dance or team sports pivot away from those interests because they do not “count.”
Compressing autonomy does more than remove variety; it interrupts the internalization process SDT describes. When students never experience ownership of their physical activities, motivations remain external. After the exam, when extrinsic pressures fall away, so do the habits and intentions for sustained activity.
Polarized competence: when the test rewards some and alienates others
Competence—the feeling of effectiveness—showed a dual pattern. For a subset of students, structured training produced clear gains and genuine pride: a novice who cut their 1,000m time from over five minutes to 4:10 reported a tangible sense of progress and strengthened confidence. For those students, the exam served as a scaffold for achievement, with targeted feedback and an achievable progression of skills.
A larger number of students experienced the opposite. Standardized, performance-bound items can create "unbridgeable gaps" for students whose physical attributes, prior experience, or developmental timeline do not match test requirements. Repeated failure produced demoralization. One student said simply, “For things like pull-ups, even if you train for a year you might not manage one more rep. I get zero every time on the test, so I just gave up.”
Two mechanisms explain this polarization. First, events like endurance runs or pull-ups favor specific physiological profiles and prior exposure. If improvement is not plausible through the available timeline and resources, effort does not yield progress—breaking the basic linkage between effort and outcome that fosters competence. Second, instruction frequently becomes performance-contingent and one-size-fits-all. When teachers must produce class-level scores, they standardize drills and intensify repetition, leaving less scope for differentiation or scaffolding that could help weaker students find incremental success.
Where competence is strengthened through realistic, personalized progression, students may shift from extrinsic to identified regulation—valuing exercise for its benefits. Where competence is eroded, motivation slides toward withdrawal or resentment: the activity is associated with humiliation and pain rather than mastery.
Classroom practice matters. Several teachers who deliberately paced learning, offered pacing strategies, or provided individualized feedback rebuilt students' competence. One student described how a teacher walked and practiced pacing, then introduced rhythm and breathing techniques: “That feeling of being cared about gave me the courage to keep trying.” Those interactions demonstrate that competence can be cultivated even in an exam context—provided instruction focuses on mastery and incremental gains rather than only on outcomes.
From teammates to rivals: the social reconfiguration of relatedness
Physical education traditionally contributes to social development—cooperative play, team coordination, and mutual encouragement. Under PEEE pressure, those relational functions were reinterpreted through the logic of score maximization. Observed shifts included: fewer group games, more individualized drills; peer comparison supplanting peer encouragement; and teachers acting more as exam supervisors than co-participants.
Students described peers as benchmarks rather than partners. One observed: “Now everyone is practicing their own event—you do your skipping, I do my sit-ups—there's basically no cooperation.” Another reported that helping a classmate is sometimes viewed as wasted time because peers fear losing their own training minutes. Teachers corroborated this, noting that competition over points can lead to exclusion or isolation of weaker students.
Teacher–student dynamics also shifted toward surveillance. With greater administrative demand for scores—recording, monitoring, and selecting high-performers for extra coaching—teachers spent more time checking standards than building rapport. Observations showed teachers patrolling, timing and correcting, with limited opportunities for dialogue or co-exploration of activities. That departure from supportive interaction undermines relatedness, which SDT identifies as essential for internalizing values and sustaining behavior.
Relatedness matters for two reasons. First, social support buffers the stress of performance pressure and creates a safe environment for risk-taking and improvement. Second, relational bonds make activities personally meaningful—students exercise not only to pass tests but to share experiences with peers and trusted adults. When PE becomes a solitary grind for points, the social incentives for ongoing participation evaporate.
One counterexample stood out: teachers who “ran with” students or offered empathic gestures preserved elements of belonging. Even small acts—walking beside a struggling runner, acknowledging effort publicly—reinstated trust and made training feel less punitive. That suggests social interventions can mitigate the relational damage produced by high-stakes formats.
Extrinsic dominance: exercise reframed as a score-getting tool
Across interviews, motivation language clustered around points, ranking and admission outcomes. Students described PE as “the easiest subject to boost your score” or as an obligation to be optimized. Many planned to stop training immediately after the exam, with one student saying, “After I finished the PE exam, I didn't want to run at all for a week—like I'd just woken up from a nightmare.”
SDT distinguishes between types of extrinsic motivation: controlled forms (external or introjected regulation) versus more autonomous forms (identified and integrated regulation). The PEEE environment encourages controlled regulation: students train to avoid penalties or to secure a marginal advantage in admission. That orientation produces compliance but not commitment. Where extrinsic motives dominate, behavior is fragile—once institutional incentives fall away, behavior often does too, sometimes replaced by active avoidance.
Parents and the broader school system amplify extrinsic motives. Parents invest in private coaching, not out of concern for a child's enjoyment but to ensure points. Schools use PE scores as performance indicators. Teachers feel evaluation pressure. This network of external controls transforms exercise into an instrument for educational mobility rather than an avenue for health or pleasure.
The “crowding-out effect” explains the psychological mechanism: when external rewards or pressures dominate, they displace intrinsic interest. Even students who might have enjoyed certain activities find their enjoyment subsumed by instrumental goals. The paradox is that a policy aimed at promoting adolescent fitness produces, at scale, a devaluing of the activity’s intrinsic benefits.
Teachers squeezed between pedagogy and performance
PE teachers occupy a difficult middle ground. They hold professional values—diversifying activities, nurturing lifelong habits, and prioritizing student well-being—but are also accountable to administrators and parents for PE scores. Interviewed teachers reported a shift from “teaching” to “training”: constructing drill schedules, recording daily scores, triaging students by test readiness.
This role strain has practical consequences. Teachers reported moral distress when asked to prioritize score gains over student enjoyment or safety. They also faced increased workload—designing extra coaching sessions, liaising with anxious parents, and managing injuries arising from intensified training.
Some teachers attempted to reconcile these pressures by embedding autonomy-supportive practices within exam-prep regimes: offering meaningful rationales for drills, setting individualized targets, or dedicating a portion of class to student choice. These tactics require extra time and emotional labor, and teachers described them as “precious” amid systemic constraints.
Teacher training and evaluation frameworks currently intensify the bind. If school-level performance assessments reward higher PE averages, teachers have strong incentives to optimize for the test. Reimagining teacher accountability to include process measures—student engagement, improvements over time, and diversity of activities—would alter teacher behavior in ways more consistent with SDT.
What works in the classroom: practical autonomy-supportive strategies
The study points to several evidence-informed teaching practices that preserve the exam's fitness goals while supporting psychological needs. Practical, small-scale changes can be implemented within current structures and produce measurable differences in student experience.
-
Offer structured choice. Even within exam-centered content, allow students to select training modes, pacing strategies, or paired partners. For example, during endurance training, students might choose interval formats, tempo runs, or paced long runs aligned to individual capacity. Choice enhances autonomy without abandoning assessment requirements.
-
Use individualized progress portfolios. Replace exclusive focus on absolute thresholds with records of personal improvement—distance covered, time reduced, reps added. Portfolios reframe competence as personal mastery rather than comparative ranking.
-
Scaffold skill progression. Break demanding events into incremental units and teach pacing, breathing and technique explicitly. Coaches who taught rhythm for rope-skipping or pacing for endurance runs helped weaker students feel progress.
-
Foster collaborative tasks. Integrate cooperative elements—peer coaching rotations, small-group drills, or shared warm-up responsibilities—that allow students to help each other without compromising individual training time.
-
Provide autonomy-affirming rationales. Explain why particular drills matter for health and long-term fitness, not just for test scores. When the purpose of training connects to students' personal goals (e.g., stamina for a favorite sport), internalization becomes more likely.
-
Preserve social rituals. Encourage group celebrations of effort, visible recognition for incremental progress, and teacher participation in training to reinforce belonging.
These practices require time, teacher skill and administrative trust. But pilot implementations in schools elsewhere have demonstrated that they can support both short-term performance and longer-term engagement.
Policy alternatives: de-linking long-term health from single high-stakes measures
At a policy level, the study recommends moving from a one-size-fits-all, summative high-stakes model toward diversified, process-oriented assessment systems. Several alternatives deserve consideration:
-
Portfolio-based assessment. Students compile evidence of participation, progress and skill development across multiple activities. Portfolios can include teacher observations, self-assessments, and objective measures. This model values growth and diversity and reduces the pressure to perfect a narrow set of events.
-
Multiple pathways for assessment. Allow students to select assessment instruments aligned to interests (e.g., team sport competency, dance or aquatic skills) so that different motor skill profiles are recognized. Some Australian and UK models allow such flexibility and report higher student satisfaction.
-
Emphasize formative feedback. Regular, low-stakes assessment with constructive feedback can promote competence by tying effort to visible improvement rather than a binary pass/fail threshold.
-
Weighting for process and participation. Instead of heavily weighting a single test battery, include measures of class participation, extracurricular involvement and demonstrated effort over time.
-
Equity-sensitive accommodations. Recognize that developmental trajectories and access to facilities differ widely. Offer alternative assessment formats for students with physical limitations or limited access to private coaching.
-
Professional development for teachers. Policies should invest in training PE teachers in autonomy-supportive pedagogy, differentiation strategies and formative assessment design.
These policy directions preserve the state’s objective—raising adolescent fitness—while protecting the psychological conditions that undergird long-term physical activity.
Equity implications: who wins and who loses under a rigid exam model
The PEEE's standardized design advantages students who can access focused training, have prior sports experience, or possess physiological traits favoring endurance or upper-body strength. Those with limited access to facilities, financial means for private coaching, or different developmental timing are more likely to struggle.
Regional disparities matter. The study sampled schools across a developed eastern city (Shanghai), a moderately developed eastern small city (Ningbo), a northeastern city (Harbin), and a less-developed western small city (Yibin). While the PEEE was implemented uniformly, local capacities—school resources, availability of coaching and parental investment—shaped different experiences. Equity-sensitive reform would need to combine assessment changes with resource allocation: providing targeted support to under-resourced schools, expanding school-based extra-curricular options, and guarding against parental private-market capture.
Beyond material equity, fairness requires recognizing varied motility profiles and learning pathways. A system that measures only a fixed battery of motor outcomes privileges some bodies and penalizes others. Alternative assessments should broaden the definition of valued physical competence.
Long-term consequences: from short-lived gains to habits that fade
A recurring theme was the temporality of fitness gains. Many improvements were tied directly to intensive, short-term training regimes. Students expressed relief and disengagement after the exam, suggesting that the PEEE yields episodic fitness increases rather than durable habit change. Where motivation remained extrinsic, behavior tended to revert or even swing towards aversion.
Sustained physical activity requires internalization—valuing exercise for its own sake or for personally meaningful outcomes. SDT posits that autonomy, competence and relatedness create that internalization. The current exam regime tends to satisfy competence partially (for some), but at the expense of autonomy and relatedness, blocking the deeper processes that produce lifelong participation.
Counterfactual examples show different trajectories. Programs that integrate choice, social support and mastery-oriented feedback—such as school sports clubs, co-curricular teams with inclusive selection policies, or after-school activity tracks—produce more durable engagement. Policy should seek to retain the fitness impetus of the PEEE while reconfiguring assessment and pedagogy to support internalization pathways.
Teachers as change agents—and the supports they need
Teachers who attempt to balance exam demands with pedagogical ideals perform crucial work. They are the ones who can transform repetitive drills into meaningful practice, who can scaffold weaker students, and who can maintain relational warmth. But they need structural support: realistic workload expectations, professional development, time to plan differentiated instruction, and evaluation systems that reward process outcomes.
Practical supports include:
- Training in autonomy-supportive techniques (offering choice, eliciting student perspectives, providing meaningful rationales).
- Assessment toolkits for portfolio-based evaluation and personalized progress metrics.
- Administrative policies that recognize teacher efforts to develop holistic outcomes, not only test averages.
- Resource allocation for equipment, safer warm-up protocols to reduce injury risk from intensified training, and opportunities for cross-school exchanges to share best practice.
Policy and school leaders who trust teachers to balance performance and well-being will find the most sustainable path to improved adolescent fitness.
Research gaps and next steps
The study's qualitative approach illuminates lived experience but leaves unanswered questions about scale and trajectory. Future research directions include:
- Large-scale quantitative surveys to measure autonomy, competence and relatedness across regions, school types and demographic groups, allowing assessment of generalizability.
- Longitudinal tracking of students across the exam period into high school to document whether exam-era training predicts sustained activity, declines or active avoidance.
- Multi-actor analyses that incorporate school administrators, local education authorities and parents to reveal governance logics and constraining incentives.
- Experimental or quasi-experimental evaluations of alternative assessment models (portfolio pilots, process weighting, multiple pathways) to test effects on short-term performance and long-term participation.
Answering these questions will clarify whether and how assessment can be designed to deliver both fairness in admissions and the psychological conditions that promote lifelong activity.
How meaningful reform might look in practice
Reform does not require dismantling the stated goal of improving adolescent fitness. It requires recalibrating how improvement is measured and what is rewarded. A pragmatic reform package might include:
- Retaining a standardized element to maintain comparability, but reducing its weight and complementing it with a portfolio and participation metrics.
- Allowing students to choose assessment pathways aligned with their interests and capacities—endurance, skills, team sports, or dance—subject to standardized competency descriptors.
- Piloting portfolio-based evaluation in a representative set of districts, with rigorous comparison to current models on both performance and psychological outcomes.
- Setting clear guidelines for teacher accountability that include process measures (student improvement, engagement) alongside results.
- Investing in teacher training and school resources to ensure equitable access to preparation.
When assessment recognizes diversity in athletic expression and values growth over ranking, instruction will shift toward mastery, and motivation will become more durable.
Voices from the field: lived evidence of tension and care
Two kinds of voices recurred in interviews: those bearing pain and those offering care. Students spoke of headaches, dread at the sound of a running lineup, and plans to stop exercising after the exam. Teachers spoke of exhaustion, parental pressure, and the moral discomfort of converting PE into a training regimen. Amid those tensions, moments of warmth mattered. Students recounted teachers who walked with them, who offered gentle praise, who celebrated incremental improvement. These relational acts were small but consequential: they preserved dignity and sometimes reignited effort.
Highlighting those moments matters because it shows that system design and human agency interact. Structural incentives shape behavior, but teacher choices within those structures change students' experience. Reform should therefore focus both on institutional levers and on empowering teachers to practice in ways that build autonomy, competence and relatedness.
FAQ
Q: What is the Physical Education Entrance Examination (PEEE)? A: The PEEE is a standardized physical fitness assessment included in the high school entrance score in many Chinese jurisdictions. It typically assesses items like sprints, endurance runs (800m/1,000m), standing long jump, sit-ups, pull-ups, and rope-jumping, and contributes a percent of the final admission score.
Q: Does the PEEE improve adolescent fitness? A: The evidence suggests it can produce short-term improvements in specific fitness measures for some students, typically those who benefit from targeted training. However, these gains often reflect intensive, short-term preparation rather than durable changes in exercise habits for the broader student population.
Q: How does the PEEE affect students' motivation to exercise? A: The PEEE tends to shift motivation toward external regulation—students exercise to gain points or avoid penalties. This pattern compresses autonomy (choice), polarizes competence (some succeed, many feel failure), and weakens relatedness (less cooperation, more comparison). Together, these effects make long-term exercise participation less likely.
Q: Are some students harmed more than others? A: Yes. Students without prior experience, limited access to training, developmental differences, or lower socioeconomic resources are more likely to experience repeated failure and demoralization. The standardized battery privileges certain physical profiles and advantages resources, raising equity concerns.
Q: What can teachers do within the current system to support students? A: Teachers can implement autonomy-supportive strategies: offer structured choices within training, set individualized, attainable goals, use portfolios or logs to show personal progress, scaffold technical skills, and maintain social rituals that foster belonging. Small acts of empathy—running with a student, praising effort—have notable impact.
Q: What policy changes are recommended? A: Recommended changes include diversifying assessment methods (portfolios, multiple pathways), shifting weight to process and participation, emphasizing formative feedback, accommodating different physical capacities, and investing in teacher development and equitable resource distribution.
Q: Would removing the exam entirely solve the problem? A: Removing high-stakes assessment could reduce instrumental pressure, but it would not automatically produce widespread participation. Without alternative systems that value participation, showcase progress, and support teachers, other pressures may fill the void. The goal is to align assessment and pedagogical design with the psychological needs that produce sustained behavior.
Q: How generalizable are the study findings? A: The study used purposive qualitative sampling across four schools in different regions, producing in-depth insights. However, its sample size and geographic coverage limit national generalizability. Large-scale quantitative and longitudinal research is needed to confirm and extend these patterns.
Q: What immediate steps could school leaders take? A: Schools can begin by piloting process-oriented measures (class participation, improvement logs), training teachers in autonomy-supportive methods, scheduling periodic “interest weeks” where students explore non-exam activities, and communicating with parents about the broader goals of PE.
Q: What would success look like? A: Success would combine measurable improvements in adolescent fitness with higher levels of student-reported autonomy, competence and relatedness in PE, sustained leisure-time physical activity post-exam, and equitable outcomes across socio-economic contexts.
The PEEE demonstrates that assessment design shapes not only the content of schooling but also the emotional and motivational architecture of young lives. When physical education is reduced to a score, its potential to cultivate healthful habits and social belonging is compromised. Reimagining assessment and classroom practice to protect autonomy, promote real mastery, and rebuild cooperative social climates offers a path from short-lived gains toward lifelong engagement in physical activity.