AI Output Risk Scoring Tool

What Is This Tool?

This interactive tool helps you evaluate the risks associated with AI outputs and determine appropriate levels of trust, scrutiny, and oversight. By answering a series of questions about your AI system, you'll receive a risk score and guidance on how to proceed.

The assessment is divided into four key risk categories:

Data & Model Design Risks - Issues related to bias, proxies, quality, drift, and transparency
Prediction & Output Risks - Concerns about accuracy, confidence, statistical thinking, and real-world value
Ethical & Interpretability Concerns - Challenges with explainability, hallucination, transparency, contestability, and trust
Human-Centered Impact Risks - Considerations for consequences, oversight, harm, appeal, and feedback loops

Understanding AI Output Risks

Data & Model Design

Prediction & Output

Ethical & Interpretability

Human-Centered Impact

Data & Model Design Risks

These risks relate to how the AI model was built, what data it was trained on, and potential biases that might be encoded in its design.

Proxies for sensitive attributes - AI may use variables like zip code or school history as stand-ins (proxies) for race, income, or gender — often unintentionally reinforcing bias.
Historical bias in training data - Past decisions (e.g., hiring or discipline outcomes) may reflect biased practices that the model learns to replicate.
Incomplete context - AI often lacks the nuance of human judgment and situational awareness. Data might not include recent changes, human intent, or localized factors.
Data Drift or Concept Drift - The model is trained on historical data, but real-world patterns have changed — leading to reduced accuracy over time.
Over-reliance on Correlation - Model surfaces patterns without causal understanding, which can be misleading in high-stakes decisions.
Poor Sampling or Representation - Certain groups or contexts are underrepresented in training data, leading to biased outputs or blind spots.

Prediction & Output Risks

These risks concern the accuracy, reliability, and interpretation of the AI's outputs and predictions.

Probabilistic vs. deterministic thinking - AI outputs are often presented with confidence, but they represent probabilities — not certainties. Users may over-trust high scores or numbers.
Overfitting or narrow logic - The model might perform well in training but struggle with real-world complexity, especially when new conditions arise.
Missing uncertainty indicators - AI rarely shows you what it doesn't know — lack of confidence intervals, assumptions, or alternative outcomes is a major risk.
"Statistical but not practical" findings - Look for actionable insight, not just mathematical confidence.
Illusion of Precision - Outputs are presented with decimal-level accuracy that gives a false sense of certainty.
Inconsistent Performance Across Groups - The model works well for some subpopulations, but not others — and the difference isn't visible unless specifically tested.

Ethical & Interpretability Concerns

These concerns relate to understanding how the AI makes decisions, whether its logic can be explained, and the ethical implications of its use.

Hallucinations (in GenAI) - AI can invent facts, sources, or reasoning — especially in generative models — and do so fluently, which makes it easy to believe.
Opaque logic ("black box" models) - Users may not understand how an answer was generated — making it difficult to verify or explain the result.
Explainability gaps - Even when models offer outputs, the reasons or factors behind those predictions may not be accessible or intelligible to end users.
No Accountability Chain - It's unclear who owns or oversees the decision made with AI. There's no escalation or appeal process.

Human-Centered Impact Risks

These risks focus on how AI decisions affect people, particularly in high-stakes contexts, and whether appropriate human oversight is in place.

Unintended consequences - A small prediction error in a high-stakes context (discipline, credit, hiring) can have major downstream effects on people.
Fairness and equity tradeoffs - AI may optimize for accuracy or efficiency at the cost of inclusion, justice, or proportionality.
Automation without oversight - Decisions made too quickly or too fully by AI — without human review — can lead to unethical or irreversible outcomes.
Feedback Loop Effects - Prior AI decisions influence future data, reinforcing patterns even if they're flawed (e.g., predictive policing, content recommendations).

AI Output Risk Assessment - Quick

Quick Assessment

If you prefer a faster evaluation, you can directly score each risk category below.

Data & Model Design Risks

Consider these factors:

Is the purpose of the model clearly defined?
Does the model use variables that could be proxies for protected attributes?
Was the training data representative of all relevant groups?
Are historical patterns encoded that reflect unfair decisions?
Is the data source well-documented?
Has the data been tested for missing values or imbalances?
Is there a risk of data drift over time?

How would you rate the overall Data & Model Design risk?

Low Risk

Medium Risk

High Risk

Prediction & Output Risks

Consider these factors:

Is the model presenting probabilities as certainties?
Are predictions provided with confidence intervals or error rates?
Could users over-trust numeric outputs due to "illusion of precision"?
Does the model differentiate between correlation and causation?
Is practical significance considered alongside statistical significance?
Are edge cases or false positives being monitored?

How would you rate the overall Prediction & Output risk?

Low Risk

Medium Risk

High Risk

Ethical & Interpretability Concerns

Consider these factors:

Can a human clearly explain how the output was derived?
Are key features or drivers of the decision available and understandable?
Is the model's logic interpretable by non-technical users?
Has the model been audited for bias or ethical risk?
Can individuals affected by the output challenge or appeal it?
Is it clear who is accountable for how the AI is used?
Does this model hallucinate or generate synthetic information?

How would you rate the overall Ethical & Interpretability risk?

Low Risk

Medium Risk

High Risk

Human-Centered Impact Risks

Consider these factors:

Who is directly or indirectly impacted by this output?
Is there a human in the loop to review or override the decision?
Could this result in harm if wrong?
Would the affected party know they were evaluated by AI?
Does the use of this AI model amplify or mitigate existing inequities?
Are outcomes tracked across demographic or population groups?
Is there a grievance process for someone negatively impacted?

How would you rate the overall Human-Centered Impact risk?

Low Risk

Medium Risk

High Risk

Quick Risk Assessment Results

AI Output Risk Assessment - Detailed

Answer the following questions about your AI system to assess its risk level.

Data-Smart Parenting - Why Every Parent Needs Data Literacy in Today's Schools

Why Data Literacy Training Fails. It's Not About the Skills, It's About the Beliefs

The Data Addiction - Why Your Organization Can't Stop Collecting Information It Will Never Use

AI Output Risk Scoring Tool

What Is This Tool?

Understanding AI Output Risks

Data & Model Design Risks

Prediction & Output Risks

Ethical & Interpretability Concerns

Human-Centered Impact Risks

AI Output Risk Assessment - Quick

Quick Assessment

Data & Model Design Risks

Prediction & Output Risks

Ethical & Interpretability Concerns

Human-Centered Impact Risks

Quick Risk Assessment Results

AI Output Risk Assessment - Detailed

Risk Assessment Results