GenAI Use Case Selection and Risk Assessment

Intended audience: Social Impact Professionals

Once you’ve made your way through steps one through four in the Playbook article “Getting Started with GenAI,” you might be ready to evaluate and select your first GenAI use case. This article is intended for anyone at social impact organizations engaging in the strategic design and decision making around GenAI use cases and risk mitigation, accessible and useful to both technical and nontechnical audiences.

To get started on brainstorming, a few use cases that are generating the most enthusiasm across the social sector include:

Personalized service delivery: A particularly common use case for educational/training content is identifying the right modules or material to provide to the end user next based on their experience. But this also applies to many different types of social impact organizations, including for tasks like supporting local campaigners with localized information from data analysis, job matching, and interview prep. Although most exciting, this use case is also the most risky. Often, when social impact organizations evaluate using GenAI for this, they end up deciding that the needed guardrails and risks outweigh the benefits for now.
Data science: Using GenAI can supplement and speed up technical work, although data and coding skills are still required by the end user for fact checking and code reviews. With that in mind, GenAI can provide assistance with:
- Data cleaning, database wrangling, quality checks, etc.
- Data analysis, qualitative text analysis, summarizing research, etc.
- Metadata prediction, labeling data, codebook creation, etc.
Knowledge management: GenAI is fantastic at navigating many resources to find and/or summarize the most relevant information. There’s a common challenge of it being overwhelming for staff or users to find the resources they need, and GenAI can speed up the process.
Content creation: GenAI can be used to create curriculum, images, communication materials, presentations, job descriptions, meeting summaries, etc. Generating new content is the classic space that the technology was designed for and will thrive in.

Now, you might be ready to dive into evaluating what makes the most sense. Here is a long list of possible social sector use cases, organized by typical social impact organization departments and the type of Generative AI used. Bring your team together to read through the list, collaboratively flagging any tasks that you see as potentially “high impact” and “low risk.”:

Type	Department	Task
Generate	Programs	Draft program updates and delivery plans
Generate	Fundraising	Create fundraising marketing documents
Generate	Grants	Draft donor reports
Generate	Communications	Create communication materials (docs, image, videos, social media content, etc.)
Generate	Fundraising	Proposal notes
Generate	Legal	Contract drafts
Generate	Operations	Note taking (i.e. for video calls)
Generate	Technology	Computer code
Generate	Human resources	Job descriptions
Generate	Programs	Strategic plans for program implementation based on needs
Generate	All	Insights and advice in specific situations of crises
Generate	All	Personalized learning and guidelines
Predict	Programs	Climate events, economic crash or inflation, forced migration, conflict, etc.
Automate	Data	Codebooks & metadata documents
Automate	Data	Labeling unlabeled data
Automate	Data	Database wrangling, cleaning, and quality checks
Automate	Data	Generating data taxonomies
Analyze	Evaluation	Project evaluation
Analyze	All	Sentiment analysis
Analyze	Human resources	Impact & performance
Detect	Programs	Natural disasters and anomalies
Detect	Technology	Fraud
Detect	Technology	Mis and disinformation
Review	Technology	Review computer code to find bugs or mistakes
Review	All	Find errors or trends within documents
Review	Human resources	Review candidates fit with job description
Summarize	Research	Rapid research synthesis
Summarize	Research	Summarize research findings and reports (analyze patterns in large sets of reports and identify trends)
Summarize	Evaluation	Summarize impact reports
Summarize	Fundraising	Summarize available funding and resources
Summarize	Operations	Summarize meeting notes
Summarize	Operations	Summarize videos or audio
Manage	Operations	Organize notes
Manage	Evaluation	Identify and organize themes within qualitative data
Manage	All	Classify information
Generate	Communications	Design presentations, storyboards, etc.
Generate	Communications	Edit images & videos
Analyze	Data	Image tagging
Generate	Data	Layout designs for visual data
Generate	Operations	Creating templates
Generate	Communications	Creating talking points
Generate	Communications	Formatting documents
Generate	Operations	Creating checklists or task lists
Analyze	Grants	Matching SDGs to outcomes
Analyze	Operations	Responding to audits
Analyze	Grants	Automatic report tagging
Analyze	Operations	Semantic search, searchable knowledge
Manage	Programs	Personalizing the service delivery experience with chatbots

Evaluate these opportunities, and any others you can think of, in order to identify and prioritize relevant use cases to define a vision and to find focus. Visit the projects section of the DataKind Playbook and dive into the “Design Stage” for resources on opportunity identification, the project design process, and narrowing your use case. Follow design thinking principles to co-create with communities impacted, discuss the most urgent problems, and identify where GenAI might create the most value.

Risks Evaluation and Mitigation

Next, consider challenges, constraints, and guardrails for risk mitigation for any possible use cases. For comprehensive risk management, consider all four steps of a standard risk assessment approach: (1) Risk Identification, (2) Risk Analysis, (3) Risk Evaluation, and (4) Risk Treatment. For the purpose of this Playbook article, we are just focused on sufficient high level risk evaluation and mitigation strategies to inform selecting a use case and scoping a GenAI project. Holistic risk management comes later on, after a use case has been identified with risks in mind.

There are so many risks that we need to be aware of when we start to think about using GenAI. We’ve broken them down into five broad categories.

Hallucination - or inaccuracy - one of the most crucial blockers to many use cases is that GenAI can provide plausible-looking responses which are factually incorrect.
Bias - a really big category that encompasses many limitations and areas of concern, this includes the fact that the model surfaces bias and prejudice as found on the internet. Additionally, the GenAI tools can be inaccessible or not inclusive across languages (for example) and other human differences, increasing inequity.
Transparency - commercial models are usually closed access. This means that you cannot know what data the model is being trained on and how it works, which is very dangerous.
Misuse - there are so many ways GenAI can be misused. For example, data leakage means that private data someone might share with an LLM may be used or exposed elsewhere. We’ve been hearing a lot about dangerous and harmful deep fakes, which is when someone modifies real content to deceive. And, even when guardrails are in place, bad actors can attempt prompt injection to jailbreak from the guardrails and exploit the model for any nefarious purpose, such as producing hate speech.
Development - the substantial computing power of these models can have a large environmental impact, there is risk of worker exploitation involved in the humans needed to test and train the model, and there is a large risk of intellectual infringement resulting in plagiarism when the model is trained on words and images created by a human without their explicit consent or usage rights.

In addition to the policy development step mentioned in the article “Getting Started with GenAI,” one of our first mitigation strategies is to scope and select a problem with the risks top of mind. It is essential to start with a very narrow and low risk use case, as this helps the LLM to maximize safety and minimize risks of harm.

On top of careful scoping to select a safer use case and general policies to mitigate risk across all GenAI use, it’s important to define specific criteria and solutions to ensure safety, accountability, and equity are built into the use case you select. The table below provides sample solutions to implement to mitigate risks for a use case. Please note it is nowhere near comprehensive; rather, it is meant to give you a solid starting point for how to think about risk mitigation in this space.

Sample area(s) of risk	Sample mitigation approach(es)
Hallucination, information loss, errors	Tools for controlling LLM outputs: Guardrails, Outlines, LMQL, and SGLang. Inference strategies to generate better outputs using calls to models and tools: chain-of-thought, self-consistency, WikiChat, automated validation techniques like RAG Less technical elements: limit data sources and always include references for required human-in-the-loop fact checking reviews
Data leakage	Data security systems, select a tool that protects privacy
Misuse, Jailbreak, model exploitation, prompt injection (i.e. a worm that targets GenAI applications such as RAG systems)	Setup model safety guardrails and build in accountability with observability, monitoring, red teaming, etc. to ensure model compliance and appropriate responses to harmful requests Use the same care as with the installation of other software and libraries with open models
Bias	Test for equity and inclusivity for historically disadvantaged communities, different languages (if applicable to your project), and other cohorts that may not be fully represented in the training data
Transparency	Select a constrained use case for clarity into how it’s working
Inaccessibility (language, internet access, etc.)	Include translation checks, create accommodations to increase accessibility, etc.
Carbon footprint/climate change	Conduct cost-benefit analysis to help select cases and approaches to minimize computing power used

It’s important to note that although GenAI is very easy for anyone to use, there is a learning curve to ensuring safety by design. Using these tools responsibly requires an investment of time to develop staff skills, implement these solutions, and continuously test and monitor.

Pulling it all together, here’s an example of what this AI risk evaluation looks like in The Wildlife Trusts’ effort led by Alice Kershaw. They have generously shared their working draft built with DataKind volunteer Daniel Nissani after participating in a DataKind Learning Circle, so that you can see a real life example of how this might play out in your social impact organization. Find more examples in the GenAI Resource Hub.

Making a decision

Focus on identifying and building low risk solutions, while mitigating any remaining risks. For example, asking an LLM questions where the answers are from its training data is high risk, with a large possibility that it will provide false information. Whereas, using an LLM to predict technical metadata on data you provided is a more low risk solution. While low risk solutions are still not risk-free and require mitigation strategies, they are much better places to start. Assess the risks and possibilities across use cases, using a simply matrix:

A graph with two axes labeled "risk" and "impact," with the quadrant labeled "low risk, high impact" highlighted as the area to look for use cases.

When selecting your GenAI problem statement, use this matrix to evaluate risk (the x-axis) versus impact (the y-axis). Think about where any possible GenAI use case might fall. You are looking for a use case that falls in the top left quadrant, with the lowest levels of risks and the highest possible impact. Think through possible use cases, assess the risks for each opportunity, and chart them on this matrix to identify what would be the best place to start.

For example, see the table below for examples of risk and guardrails specific to a few program evaluation and impact assessment use cases:

Use case/opportunity	Area(s) of risk	Mitigation approach(es)
Summarize research findings and reports (analyze patterns in large sets of reports and identify trends)	Summaries reflecting demographic biases and inaccuracies from low quality inputs.	Implement bias testing. Follow ethical data practices. Use quality filters for input sources. Account for language/cultural nuances.
Sentiment analysis (clustering responses to a positive, negative or neutral sentiment depending on the text using natural language processing)	Biases in training data leading to inaccurate sentiment analysis across demographics.	Periodically validate sentiment analysis results against human judgments and adjust models accordingly.
Conduct analysis for project evaluation	Hallucination, security, overreliance: Potential biases in evaluation processes and decision-making rationale lacking transparency.	Regularly validate project evaluations. Include reasons behind decisions for transparency. Analyze data and “score” using different weights to determine confidence levels.
Identify and organize themes within qualitative data	Omission of key perspectives across diverse contexts in identified themes. Data security.	Validate AI-identified themes against human judgment. Provide clear security controls for any sensitive qualitative data.

This about how harmful the risk is, beneficial the use case is, and feasible the mitigation approach identified is. Creating a table like the one above to unpack the use cases you are considering will help you prioritize and select a starting use case.

Please see other “scoping” articles for social impact organizations, just like seen at the bottom of the page here: https://datakind.github.io/social-impact/scoping/

Contributer(s): Rachel Wells with help from Deborshi Goswami, Alice Kershaw, Daniel Nissani, Matt Harris, Matthias Boeck, Caroline Charrow, Fotis Zapantis, Kathleen Yaworsky, Lindsey Washburn

Contact us

If you would like to learn more about us, partner with us, or get in touch, visit our website or email community@datakind.org.

Subscribe to our newsletter
Subscribe

Project Stages

Discovery

Design

Prepare

Execute

Share

Evaluate

Social Impact

Engineering

GenAI

Scoping

Volunteers

Getting Started

Donate

Chapter Operations

Volunteer Management

GenAI Use Case Selection and Risk Assessment

Risks Evaluation and Mitigation

Making a decision

Contact us