AI Marking for Teachers: Top Marks AI Achieves 1.02 Mark MAE on Edexcel GCSE History: 12 Mark Explain Question

Study reveals Top Marks AI achieving 1.02 Mark MAE for Edexcel GCSE History: 12 Mark Explain Question, November 24, 2025

AI Marking for Teachers Achieves 1.02 Mark Average Error for Edexcel's GCSE History: 12 Mark Explain Question

"How accurate are your AI GCSE History marking tools?" We encounter this question regularly when speaking with teachers and educational institutions.

For questions with a limited marking range like this 12-mark question, we focus on a particularly important metric:Mean Absolute Error (MAE). MAE tells us, on average, how many marks our AI differs from the exam board's marks. A low MAE means high accuracy.

As such, we've been systematically testing to prove the accuracy of the Top Marks' GCSE History AI marking tools really are. We think you'll find the results compelling!

In this experiment, we will be examining Edexcel History -- specifically, the GCSE History: 12 Mark Explain Question.

Edexcel makes available numerous exemplar essays for their exam papers and we've put our tool to the test using 64 of those very same exam board approved standardisation materials. These exemplars showcase a broad spectrum of answer quality. These are official standardisation materials that show teachers the spectrum of answer quality.

We took 64 of these essays and ran them through our dedicated marking tool. Then we measured the difference between the official marks the board awarded each essay, and the marks Top Marks AI assigned to those same essays.

For context, how do humans perform?

What level of accuracy do experienced human markers achieve when marking essays already marked by a lead examiner?

Cambridge Assessment conducted a rigorous study to measure precisely this. 200 GCSE English scripts - which had already been marked by a chief examiner - were sent to a team of experienced human markers. These experienced markers were not told what the chief examiner had given these scripts. Nor were they shown any annotations.

The Mean Absolute Error (average difference) between the experienced markers and the chief examiner was 5.64 marks on a 40-mark question -- that's an average difference of 14.1%. You you can find the study here.

How did Top Marks AI perform?

Our system demonstrated a correlation of, our system achieved a Mean Absolute Error of 1.02 marks. On average, the AI differed from the board by just 1.02 marks on this 12-mark question. As a percentage, that's an average of 8.5% difference -- significantly better than the 14.1% human marker difference in the Cambridge study.

Moreover, 81.25% of the marks we gave were within 1.5 mark of the grade given by the chief examiner.

As an additional measure of accuracy, we also calculated the Pearson correlation coefficient, which was 0.88. This indicates a strong positive relationship between our marks and the exam board's marks, showing that when the board assigns higher marks, Top Marks AI does too, and vice versa.

We don't claim that Top Marks is infallible, but when it does get things wrong, just how bad is it? Well, let's turn to the Root Mean Square Error to find out. Root Mean Square Error (RMSE) is a measure of the severity of large errors. When you square the number 1, you still get 1, and when you square 2, you still only make a small jump to 4. But square 5, and you're suddenly all the way up at 25. That's how RMSE works - it (essentially!) highlights large errors by squaring them.

Top Marks AI's Root Mean Square Error was 1.42, meaning even when larger errors occur, they remain remarkably small relative to the 12-mark scale.

You can see the full side-by-side human and AI scores below.

Essay ID	Board Score	Top Marks AI Score	Difference
summer June 2024 4 -12 Marks 1 (-) (6).pdf	6.0	7.1	+1.1
Summer 2022 Q1b (Henry VIII) -12 Marks 1 (-) (5).pdf	5.0	5.0	+0.0
Summer 2022 Q1b (Elizabethan England) -12 Marks 1 (-) (5).pdf	5.0	6.3	+1.3
Summer 2022 4 -12 Marks 1 (-) (6).pdf	6.0	6.5	+0.5
June 2024 4 -12 Marks 1 (-) (8).pdf	8.0	6.3	-1.7
Summer 2022 Q1b (Richard & John) -12 Marks 2 (-) (11).pdf	11.0	10.8	-0.2
June _ Summer 2024 4 -12 Marks 1 (-) (3).pdf	3.0	2.0	-1.0
Summer 2022 Q1b (Anglo-Saxons & Normans) -12 Marks 1 (-) (5).pdf	5.0	4.8	-0.2
June Summer 2024 4 -12 Marks 1 (-) (12).pdf	12.0	12.0	+0.0
Exemplars 5b -12 Marks 1 (-) (6).pdf	6.0	4.7	-1.3
Exemplars 4 -12 Marks 1 (-) (5).pdf	5.0	9.0	+4.0
Summer 2022 Q1b (Elizabethan England) -12 Marks 2 (-) (11).pdf	11.0	10.0	-1.0
Summer 2022 4 -12 Marks 2 (-) (10).pdf	10.0	10.7	+0.7
Summer 2022 Q1b (Henry VIII) -12 Marks 2 (-) (11).pdf	11.0	10.0	-1.0
June 2024 4 -12 Marks 2 (-) (11).pdf	11.0	10.7	-0.3
June _ Summer 2024 4 -12 Marks 2 (-) (12).pdf	12.0	12.0	+0.0
Summer 2022 Q1b (Anglo-Saxons & Normans) -12 Marks 2 (-) (11).pdf	11.0	12.0	+1.0
June Summer 2024 4 -12 Marks 2 (-) (8).pdf	8.0	5.3	-2.7
Summer 2022 Q1b (Richard & John) -12 Marks 1 (-) (5).pdf	5.0	3.4	-1.6
Exemplars 4 -12 Marks 2 (-) (7).pdf	7.0	7.0	+0.0
Exemplars 5b -12 Marks 3 (-) (8).pdf	8.0	6.9	-1.1
Exemplars 4 -12 Marks 3 (-) (9).pdf	9.0	8.8	-0.2
Exemplars 5b -12 Marks 4 (-) (11).pdf	11.0	12.0	+1.0
summer June 2024 4 -12 Marks 2 (-) (12).pdf	12.0	10.0	-2.0
2019 Paper 1 crime 12 Marks 2 (-) (8).pdf	8.0	8.7	+0.7
2019 Paper 1 crime 12 Marks 1 (-) (11).pdf	11.0	10.0	-1.0
Exemplars 5b -12 Marks 2 (-) (8).pdf	8.0	7.3	-0.7
2019 Paper 1 warfare 12 Marks 1 (-) (11).pdf	11.0	10.0	-1.0
2019 Paper 1 warfare 12 Marks 2 (-) (5).pdf	5.0	8.0	+3.0
2019 Paper 2 Anglo Saxon 12 Marks 2 (-) (12).pdf	12.0	8.3	-3.7
2019 Paper 2 King Richard I and King John 12 Marks 1 (-) (12).pdf	12.0	10.0	-2.0
2019 Paper 2 King Richard I and King John 12 Marks 2 (-) (7).pdf	7.0	9.4	+2.4
2019 Paper 2 Henry VIII 12 Marks 1 (-) (6).pdf	6.0	9.5	+3.5
2019 Paper 2 Anglo Saxon 12 Marks 1 (-) (4).pdf	4.0	2.8	-1.2
2019 Paper 2 Elizabethan 12 Marks 1 (-) (6).pdf	6.0	6.6	+0.6
2019 Paper 2 Henry VIII 12 Marks 2 (-) (9).pdf	9.0	9.7	+0.7
2019 Paper 2 Elizabethan 12 Marks 2 (-) (9).pdf	9.0	8.5	-0.5
2019 Paper 2 Henry VIII 12 Marks 3 (-) (12).pdf	12.0	12.0	+0.0
2019 Paper 2 Elizabethan 12 Marks 3 (-) (12).pdf	12.0	12.0	+0.0
Migrants in Britain 4 -12 Marks 1 (-) (12).pdf	12.0	12.0	+0.0
Migrants in Britain 4 -12 Marks 2 (-) (5).pdf	5.0	5.0	+0.0
Migrants in Britain 4 -12 Marks 3 (-) (5).pdf	5.0	5.7	+0.7
Migrants in Britain 4 -12 Marks 4 (-) (12).pdf	12.0	12.0	+0.0
Examiner's Report - B4 Paper 2 June 2024 Q1b -12 Marks 1 (-) (12).pdf	12.0	11.6	-0.4
Examiner's Report - B4 Paper 2 June 2024 Q1b -12 Marks 2 (-) (9).pdf	9.0	9.3	+0.3
Examiner's Report - B4 Paper 2 June 2024 Q1b -12 Marks 3 (-) (11).pdf	11.0	12.0	+1.0
Examiner's Report - B2 Paper 2 June 2024 Q1b -12 Marks 1 (-) (2).pdf	2.0	2.0	+0.0
Examiner's Report - B1 Paper 2 June 2024 Q1b -12 Marks 2 (-) (5).pdf	5.0	3.6	-1.4
Examiner's Report - B1 Paper 2 June 2024 Q1b -12 Marks 2 (-) (5).pdf	5.0	3.6	-1.4
Examiner's Report - B2 Paper 2 June 2024 Q1b -12 Marks 2 (-) (9).pdf	9.0	9.3	+0.3
Examiner's Report - B1 Paper 2 June 2024 Q1b -12 Marks 1 (-) (8).pdf	8.0	6.0	-2.0
Examiner's Report - B1 Paper 2 June 2024 Q1b -12 Marks 1 (-) (8).pdf	8.0	6.0	-2.0
Examiner's Report - P2 Paper 1 June 2024 2 -12 Marks 1 (-) (11).pdf	11.0	11.8	+0.8
Examiner's Report - P2 Paper 1 June 2024 2 -12 Marks 1 (-) (5).pdf	5.0	4.8	-0.2
Examiner's Report - P2 Paper 1 June 2024 2 -12 Marks 2 (-) (7).pdf	7.0	7.0	+0.0
Examiner's Report - P2 Paper 1 June 2024 2 -12 Marks 1 (-) (12).pdf	12.0	12.0	+0.0
Germany Examiner's Report - P2 Paper 1 June 2024 2 -12 Marks 1 (-) (11).pdf	11.0	11.7	+0.7
Germany Examiner's Report - P2 Paper 1 June 2024 2 -12 Marks 2 (-) (5).pdf	5.0	5.8	+0.8
Examiner's Report - 1HI0 30 Paper 1 June 2024 2 -12 Marks 2 (-) (12).pdf	12.0	12.0	+0.0
Examiner's Report - 1HI0 30 Paper 1 June 2024 2 -12 Marks 1 (-) (7).pdf	7.0	4.4	-2.6
2019 Paper 1 medicine 12 Marks 1 (-) (11).pdf	11.0	12.0	+1.0
2019 Paper 1 medicine 12 Marks 2 (-) (8).pdf	8.0	11.3	+3.3
Medicine June 2022 12 Marks 2 (-) (12).pdf	12.0	11.5	-0.5
Medicine June 2022 12 Marks 1 (-) (8).pdf	8.0	8.7	+0.7

Can I see a graph to help me visualise this?

Absolutely.

First, here's a scatter graph to show you what a theoretical perfect correlation of 1 would look like:

Now, let's look at the real-life graph, drawn from the data above:

Actual Correlation Graph for Edexcel GCSE History: 12 Mark Explain Question

On the horizontal axis, we have the grade given by the exam board. On the vertical, the grade given by Top Marks AI. The individual dots are the essays -- their position tells us both the mark given by the exam board and by Top Marks AI. You can see how closely it resembles the theoretical graph depicting perfect correlation.

Discover how Top Marks AI can revolutionise assessment in education. Contact us at hello@topmarks.ai.

We use cookies for analytics and marketing to improve your experience — these are only set if you accept. Decline and we'll only use cookies that are strictly necessary. (Live chat is always available either way.) Learn more in our Cookie Policy.