A free, open tool that scores any AI system out of 100 – so anyone can understand whether an AI tool is truly justified, regardless of their technical background.
Artificial intelligence (AI) is now used everywhere – in hospitals, schools, social media, policing, and creative industries. But most people have no way of knowing whether a particular AI tool is actually good for society, or whether it causes harm. Experts disagree, the science is complex, and companies rarely publish honest assessments of their own products.
This tool solves that problem by applying a scientific scoring method to 40 real AI systems. It measures five things that matter – environmental impact, ethical risk, creative displacement, purpose, and transparency – and combines them into a single Justifiability Score from 0 to 100, along with an A–F grade (just like an EU energy rating on a fridge). The higher the score, the more justified the AI system is. The lower the score, the more reasons there are to question whether it should exist.
You don't need to know anything about AI or data science to use this tool. Everything is explained in plain language.
This tab lets you explore our database of 40 real AI use cases that we have already evaluated — from life-saving medical tools to harmful surveillance systems. Select any use case from the dropdown to see its full score breakdown and understand exactly why it received that score.
You can also adjust the weights below to change how much each dimension counts. For example, if you personally care more about ethics than emissions, slide the Ethics weight up and see how the scores change.
Think of weights like marks in an exam – they decide how much each subject counts towards your final grade. The five sliders must add up to 1.00 (100%). Our defaults are based on academic research, but you can change them to reflect your own values.
Below you can see the score for each of the five dimensions we measure. Each score is out of 100 – higher is always better (meaning lower risk, lower emissions, or higher transparency). The final Justifiability Score is a weighted combination of all five.
Do you use an AI tool and want to know whether it is truly justified? Enter its details below and we will calculate a Justifiability Score for it – using exactly the same scientific method we used for our 40 pre-evaluated use cases.
You don't need to be a technical expert. Each field below is explained so you know exactly what information to provide. If you are unsure about a value, use your best estimate – the tool is designed to be accessible to everyone.
Optional: adjust how much each dimension counts towards the final score. All five sliders must add up to 1.00. If you are unsure, leave these as they are – the defaults are based on published research.
Each score is out of 100 – higher is always better. The final Justifiability Score is a weighted combination of all five, using the weights you set above. Click the button below each section to understand the science and data behind each score.
This table shows all 40 AI systems ranked from most to least justifiable using our scoring framework. Use it to compare systems side by side and see at a glance which AI tools are well justified and which are not.
Try changing the weights below to see how the rankings shift depending on what you care about most. For example, push the Ethics slider to 0.50 and watch which use cases rise and fall in the ranking – it reveals how much the result depends on your value priorities.
Click the 🏷️ Label button on any row for a full breakdown of why that AI system received its score.
Move any slider and the entire table re-ranks instantly. This demonstrates that our framework is transparent – you can see exactly how and why each AI system's rank changes based on different priorities.
| # | Use Case | Score | Band | Purpose | Label |
|---|
Each bar represents a 5-point score band (e.g. 60–65). The height of the bar shows how many of the 40 AI systems fall into that band. A good framework should produce a spread-out distribution – if all scores were bunched together in the middle, the tool would not be discriminating enough to be useful.
This tab pulls back the curtain on the science behind this tool. Every chart and number is calculated live from the real dataset. Click any card to expand it and see the full evidence. You don't need to know anything about statistics — everything is explained in plain English.
We built a 5-dimension scoring formula, stress-tested it with 300 random experiments, and validated it against human judgement. This tab shows the receipts.