The specific-examples gap: how ESS candidates earn marks they should be keeping

IB ESS Paper 2 rewards precise named examples more than candidates realise. This article dissects the specific-examples threshold, the five evidence categories, and the three-sentence example…

IB CoursesJune 1, 202614 min read

Strong ESS candidates frequently underperform on Paper 2 not because their content knowledge is weak, but because they mishandle the command term "with reference to specific examples." The rubric does not reward vague case-study allusions or conceptual references — it rewards named, situated, precise examples that demonstrate genuine systems thinking. This article dissects exactly what the rubric requires, why the threshold separates 5s from 6s, and the five evidence categories you can deploy consistently under exam conditions. If you are preparing for IB ESS Paper 2, this is the skill that will most directly raise your evaluation band.

Why the specific-examples command trips more candidates than 'evaluate'

Most ESS candidates entering the exam room expect the hard part to be the evaluation itself — weighing competing claims, constructing a sustained argument, reaching a justified conclusion. In practice, examiners report that the more consistent failure is the inability to substantiate evaluation claims with examples that meet the rubric's standard. The word "examples" appears in dozens of Paper 2 questions, and the phrase "with reference to specific examples" or "using named examples" appears in roughly two-thirds of Section B prompts. Candidates who treat these as optional flavour text rather than core assessment criteria lose marks on every single response.

The command term "evaluate" asks candidates to make a judgement supported by evidence and reasoning. "With reference to specific examples" tells you what kind of evidence the examiner expects: not vague allusions to concepts, but named cases situated in real locations, time periods, or systems. A candidate who writes "deforestation causes biodiversity loss" is making a true claim. A candidate who writes "deforestation in the Malaysian peninsula between 2000 and 2020 released approximately 1.5 Gt of stored carbon and reduced the range of the Bornean orangutan by an estimated 50%" is meeting the specific-examples threshold. Both answers are factually correct. Only one is worth a Level 6.

What the rubric actually requires

The Paper 2 assessment criteria for evaluation questions allocate marks across four dimensions: knowledge and understanding, application and analysis, synthesis and evaluation, and the quality of examples and evidence. The final dimension carries between 2 and 3 marks depending on the question — a proportion that often determines whether a response sits at the bottom or top of a band. The criterion descriptor for a Level 6 response explicitly references "well-chosen examples" that are "named and situated" and that "effectively support the evaluation." The Level 5 descriptor uses "appropriate examples" without the precision requirement. That single word — precise — is the operational difference between the bands.

The rubric's language is precise here. "Examples" without qualification opens the door to generic case-study references. "Specific examples" demands something named and situated. When a question specifies "named examples," the threshold rises further: geographic specificity is expected. A Level 5 response might reference "deforestation in the Amazon." A Level 6 response specifies "the Brazilian Amazon, which lost approximately 1.1 million hectares of primary forest in 2019 alone according to INPE data." The difference is not just detail density — it is the demonstration that the candidate understands how to anchor systems thinking in verifiable, located evidence.

Common pitfalls: three patterns that cost marks every exam

The first and most damaging pattern is the umbrella term. Candidates write "tropical rainforest" when the rubric expects "the Amazon basin" or "the Tapajós River region within the Brazilian Amazon." The umbrella term signals that the candidate has content knowledge but lacks the geographical precision the assessment requires. Under exam conditions, umbrella terms often appear because candidates cannot recall the specific location — a preparation problem, not a knowledge problem.

The second pattern is the example footnote — treating the example as a separate addendum to the argument rather than an integral component. A response might build a sophisticated evaluation of renewable energy policy, then append "e.g., Germany's Energiewende" as if the named case study is an optional illustration. Examiners reading for integration mark this approach down because it suggests the candidate has not understood that examples are evidence within the argument, not decoration after it.

The third pattern is more subtle: the example that is named but not analysed. A candidate writes "the Great Barrier Reef has experienced significant bleaching events," which names the example but fails to connect it to the evaluation. What exactly did the bleaching demonstrate about ecosystem resilience? What quantitative evidence supports the claim? How does this specific case illuminate the general principle the question asks about? A named but unanalysed example earns very few of the available marks.

The re-reading trap

A fourth pattern worth naming: candidates who rely exclusively on examples they studied during the course and do not extend them with additional detail. This is not wrong — course examples are entirely valid — but if every candidate in the exam room references the same four case studies, the examiner sees a lack of independent depth. A strong response might take a course example and extend it: if the course covers deforestation in the Amazon, a Level 6 response adds recent data on fire frequency, specific indigenous communities affected, or a particular policy intervention and its documented outcomes. This layered use of a course example demonstrates depth beyond what the textbook provides.

The five evidence categories you can deploy in every Paper 2 exam

Effective specific examples fall into five distinct categories. Understanding these categories allows you to build a flexible toolkit during preparation and select the right type of evidence for any question context.

Geographically situated examples name an actual location with sufficient precision to be verifiable: a specific national park, a named river basin, a particular country or region. "Palm oil plantations in Sarawak" carries more weight than "palm oil production."
Temporally precise examples anchor claims to a specific period, year, or timeframe rather than a vague "recent decades." "Australia's bushfire season of 2019-2020" is more precise than "Australian bushfires."
Quantitative examples include a number that substantiates the claim: a percentage, a rate, an area, a population figure. "Coral cover on the Great Barrier Reef declined by approximately 50% between 1995 and 2017" is a quantitative example.
Named species or system component examples identify the specific organism, process, or policy by both common and scientific name where relevant. "Bornean orangutan (Pongo pygmaeus)" demonstrates precision that "orangutan" alone does not.
Policy or intervention examples name a specific programme, treaty, or regulation and its documented effects. "Costa Rica's Payments for Ecosystem Services scheme, operational since 1997, has been credited with reducing deforestation rates from 4% to under 1% annually in some regions."

You do not need to deploy all five categories in every example. Most strong responses use two or three categories per example, selected for relevance to the question. The skill lies in matching the example type to the evaluative demand of the question.

The three-sentence example integration pattern

Under exam conditions, the most reliable way to meet the specific-examples threshold consistently is to use a three-sentence integration pattern within each evaluative paragraph. This pattern ensures that every example you introduce is named, situated, and connected to the evaluative claim.

Sentence 1 — Name and situate: "The Amazon rainforest in Brazil lost approximately 1.1 million hectares of primary forest in 2019 according to INPE satellite data." This sentence satisfies the geographic and temporal precision requirements in a single statement.

Sentence 2 — Provide the relevant detail: "This deforestation reduced carbon sequestration capacity by an estimated 0.2 Gt CO₂ annually while fragmenting habitat for approximately 10% of global terrestrial species." This sentence connects the example directly to the systems concept — in this case, ecosystem services and biodiversity — that the question addresses.

Sentence 3 — Draw the evaluative implication: "This demonstrates that even large-scale protected areas can experience significant degradation when economic pressures and governance failures coincide, which supports the argument that conservation outcomes depend more on policy enforcement than on spatial designation alone." This sentence explicitly connects the example to the evaluative judgement the response is making.

Practising this pattern during preparation builds a repeatable habit. In the exam, the pattern becomes automatic, and you can vary the sentence structure to avoid sounding formulaic while maintaining the three components: name and situate, provide the relevant detail, draw the evaluative implication.

Named examples and evaluation quality: the rubric table

The table below illustrates the observable differences between bands in the examples and evidence criterion for a typical Paper 2 evaluation question. Use this as a diagnostic tool when reviewing your own responses.

Band	Example quality descriptor	Observable features in a response
Level 6-7	Well-chosen, named, and situated examples; effectively support the evaluation	Specific location and time period for each example; quantitative data included; examples integrated into the evaluative argument rather than appended
Level 5	Appropriate examples used; generally relevant	Examples are relevant to the question but lack precision; may use umbrella terms; examples present but not consistently integrated
Level 4	Examples mentioned but not fully developed or analysed	Examples are named but lack supporting detail; no quantitative data; evaluative connection to the argument is weak or absent
Level 3 and below	Minimal or no use of specific examples	Conceptual references without named cases; general statements without any situated evidence

Building your example reference sheet before the exam

The single most effective preparation activity for the specific-examples threshold is building a structured reference sheet of approximately 20 examples organised by syllabus topic. Each example on the sheet should include the following: the location, the time period, one or two quantitative data points, and the systems concept it best illustrates. During the exam, you do not copy from this sheet — you use it during revision to internalise the details so that they are available under timed conditions.

Select your examples strategically. Ideally, each example should be capable of illustrating at least two different syllabus concepts, which gives you flexibility across different question types. For instance, the Great Barrier Reef bleaching events of 2016-2017 can illustrate ecosystem resilience, biodiversity loss, climate change impacts, economic consequences of environmental degradation, and the limitations of protected area management — five different syllabus areas from a single, well-understood example. If you can name, situate, and provide data for 15-20 examples of this quality, you will have more than enough material for both Paper 2 questions.

Avoid the temptation to collect as many examples as possible. Twenty examples with precise details are worth far more than fifty examples with vague ones. The rubric does not reward the quantity of examples — it rewards the quality of their integration into evaluative argument.

Common pitfalls and how to avoid them

Beyond the three patterns discussed earlier, there are two additional traps worth naming explicitly. First, the single-example dependency: candidates who find one well-understood case study and apply it to every question regardless of relevance. The Amazon appears in deforestation questions, yes — but it is not the right evidence for a question about ocean acidification or urban resource consumption. Relevance matters. The examiner can tell when an example is being forced into a question where it does not naturally fit.

Second, the invented data trap. Under time pressure, candidates sometimes fabricate statistics to add the quantitative dimension to their examples. This is genuinely dangerous: examiners cannot verify individual figures in the exam room, but invented data is penalised severely if it is identified, and more importantly, the habit of invention erodes the precision that the rubric rewards. If you cannot recall a specific figure, use a qualitative description instead: "coral cover declined substantially between 2016 and 2017" is weaker than a number, but it is honest. Never invent a percentage or a population figure to fill a gap.

The practical fix for both traps is the same: timed practice with self-assessment against the rubric table above. When you write a practice answer, identify each example you have used and ask three questions: Is it named? Is it situated? Does it connect to the evaluative claim? If the answer to any of these is no, revise the example before moving on.

Applying the same precision to Paper 1

While the specific-examples threshold is most visible in Paper 2 Section B, it also operates in Paper 1 — particularly in the stimulus-based questions of Section A where candidates must interpret data, graphs, or diagrams and connect them to syllabus concepts. The principle is the same: data that is named, situated, and connected to a concept earns more marks than data that is merely described. A response that identifies a trend in a specific dataset, names the location and time period it refers to, and explains the systems mechanism behind the trend is demonstrating the same precision that earns high marks in Paper 2.

The difference is that Paper 1 examples come from the stimulus material rather than from your preparation. The preparation skill that transfers across both papers is the habit of asking: what is the location? What is the time period? What quantitative data is available? How does this specific case illustrate the syllabus concept? Applying this habit to unseen stimulus material during the exam is a trainable skill, and it is worth dedicating practice time to Paper 1 Section A interpretation alongside your Paper 2 preparation.

Conclusion and next steps

The specific-examples threshold is not a vocabulary test. It is a systems-thinking assessment: the examiner wants to see whether you can ground abstract evaluation in concrete, situated evidence. The rubric rewards precision not because detail is inherently impressive, but because the ability to name, locate, and quantify environmental evidence is itself a demonstration of the systems thinking that ESS as a discipline requires.

If you take one action from this article, build a reference sheet of 15-20 examples, each with a location, a time period, and one or two quantitative data points. This single preparation activity will address the specific-examples threshold across both Paper 1 and Paper 2 and will have more impact on your overall score than almost any other study activity available to you. Combine this with timed practice in the three-sentence integration pattern, and you have a reliable method for meeting the rubric's requirements in every evaluation response you write.

The command to use specific examples is asking you to demonstrate that ESS is not just a subject you have studied — it is a way of understanding environmental systems through evidence. That demonstration is what separates the responses that reach Level 6 from the ones that remain at Level 5, and it is the skill that will serve you most directly in the IB ESS exam.

ESS Paper 1 and Paper 2: why your reading strategy matters more than your content knowledge Why your ESS diagrams earn fewer marks than your written answers — and the evidence hierarchy that fixes it Why do ESS Paper 1 Section A scores lag behind Section B? The preparation gap most candidates never close

Frequently asked questions

Why does 'with reference to specific examples' appear in so many ESS Paper 2 questions?

The phrase reflects the discipline's core methodology: ESS requires evidence-based argumentation grounded in real environmental systems. The command term ensures candidates demonstrate not just conceptual understanding but the ability to locate, name, and support claims with situated evidence. This is how ESS distinguishes itself from purely theoretical subjects.

How many specific examples do I need per Paper 2 response?

Most Level 6 responses include 2-3 named, situated examples per question, each integrated directly into the evaluative argument rather than listed separately. Quality matters more than quantity: a single well-chosen example with geographic precision, temporal specificity, and quantitative data will earn more marks than three vague references.

Can I use examples from my ESS course notes and still reach Level 6?

Absolutely. Course examples are entirely valid and often the most reliable source of detailed evidence. The key is extending them beyond what the textbook states: adding recent data, specific years, or policy outcomes that demonstrate independent depth. Layering your own research onto course examples is a reliable path to Level 6.

What is the difference between 'examples' and 'named examples' in the rubric?

'Examples' without qualification may be satisfied by general case-study references. 'Named examples' requires geographic specificity — actual locations with names and ideally coordinates or regions. A response that names 'deforestation in the Brazilian Amazon' meets the named-examples threshold more fully than one that references 'tropical deforestation' in general terms.

Is it better to use a few detailed examples or many brief examples across the paper?

A few detailed examples consistently integrated into evaluative arguments outperforms many brief examples that are appended separately. The rubric rewards integration and depth. Four to six well-developed examples per paper, each with location, time period, and quantitative data, will position a response strongly across all four assessment criteria.

The specific-examples gap: how ESS candidates earn marks they should be keeping

Why the specific-examples command trips more candidates than 'evaluate'

What the rubric actually requires

Common pitfalls: three patterns that cost marks every exam

The re-reading trap

The five evidence categories you can deploy in every Paper 2 exam

The three-sentence example integration pattern

Named examples and evaluation quality: the rubric table

Building your example reference sheet before the exam

Common pitfalls and how to avoid them

Applying the same precision to Paper 1

Conclusion and next steps

Frequently asked questions

Related Posts

How to plan a 12-week ESS SL revision block around the IA and Paper 1

3 ESS SL Section A answering moves that lift a 4 into band 5

4 ESS SL case-study moves that rescue a Paper 2 Section B answer

Why the specific-examples command trips more candidates than 'evaluate'

What the rubric actually requires

Common pitfalls: three patterns that cost marks every exam

The re-reading trap

The five evidence categories you can deploy in every Paper 2 exam

The three-sentence example integration pattern

Named examples and evaluation quality: the rubric table

Building your example reference sheet before the exam

Common pitfalls and how to avoid them

Applying the same precision to Paper 1

Conclusion and next steps

Related reading

Frequently asked questions

Related Posts

How to plan a 12-week ESS SL revision block around the IA and Paper 1

3 ESS SL Section A answering moves that lift a 4 into band 5

4 ESS SL case-study moves that rescue a Paper 2 Section B answer