External providers and data processing

Our AI features rely on several enterprise sub-processors, all accessed through secure APIs with enterprise-level data handling. The provider used for each feature is configurable per customer and can include Microsoft Azure OpenAI (GPT-5.4 and other GPT models), AWS Bedrock running Anthropic Claude and Cohere embeddings, and Google Vertex AI (Gemini). Audio and video transcription additionally uses AssemblyAI. In all cases, data is processed in the region specified by the customer.

Key data handling practices:

Purpose-limited use: Data is processed only to provide and support the AI services
No consumer-product access: each service runs inside the cloud provider’s enterprise environment (Microsoft Azure, AWS Bedrock, or Google Vertex AI) and does not interact with public consumer products such as ChatGPT or the public OpenAI API
Regional hosting: Data is processed in the region specified by the customer (see the list of regions further down this article)
Access control: the cloud provider only accesses data for abuse monitoring purposes

What data is sent to these subprocessors and does it include PII?

By default, only textual user contributions (e.g., survey answers or ideas) are sent to the AI subprocessors. These are submitted to support AI analysis features like summarization and tagging

When is data sent?

Surveys: Text is sent when an admin visits the survey results page.
Ideation: Text is sent when an admin actively starts an AI analysis.

What kind of data is included?

Only the free-text content users wrote in their contributions
No structural user information is shared (e.g., email, username, profile picture, demographic data)

⚠️ If a user includes personal information (PII) in their own contribution text, that PII may be sent to sub-processors as part of the message content. This is not filtered out automatically.

Are these subprocessors using the data to train and improve their models?

No. All of our AI sub-processors explicitly state that they do not use the data to train or improve their models.

Where is Microsoft processing the data?

Microsoft allows us to specify the region of processing. We currently make use of 7 regions, where our customers use the region most local to them. The regions are:

Frankfurt
UK
US
Canada
Brazil
Paris
Stockholm

Why are the answers of AI not in my language?

Our AI feature tries to answer as much as possible in the language of the input it’s receiving. Exceptionally, in cases where there are mixed languages, there are very few inputs, or when the AI gets it wrong, it might generate answers in the wrong language. In such cases, it mostly suffices to retry.

⚠️ All core languages are supported with the exception of Greenlandic

How accurate are the generated summaries?

Summarization always means discarding some details while retaining what seems most important. AI models are strong at identifying common elements, but deciding what is most relevant requires context, domain knowledge, and subjective judgment. Because of this, summaries can be very useful but are not 100% accurate. Human oversight remains essential.

To ensure correct conclusions, our approach emphasizes responsible AI use:

The human reviewer always remains in control.
Transparency is maximized so you can verify how summaries are created.
AI provides efficiency, while humans ensure quality and accuracy.

Our platform includes several features to help you assess and improve summary quality:

Expected accuracy indication: Before and after generating a summary, the system shows an accuracy estimate (percentage)
In-line references: Each summary links back to the original resident inputs it is based on
Full data access: All project inputs remain browseable, so you can always compare summaries with raw contributions.
Tagging: Manually or automatically segment inputs into smaller groups for more focused, accurate summaries
Auto-tagging options: Multiple methods are available; tags can always be overridden for maximum control
Source-available software: The code is available on GitHub, ensuring transparency into how the tool works

We recommend using AI-generated summaries as a starting point for understanding large datasets, but not as the final word. Always check the results!

Hallucinations: While rare, the AI might occasionally generate information that was not explicitly present in the original dataset.
Exaggeration: The AI might emphasize certain themes or ideas more than others, potentially skewing the overall interpretation.
Data Volume and Accuracy: Our system is optimized for handling 20-200 well-defined inputs for the most accurate results. As the volume of data increases beyond this range, the summary may become more high-level and generalized. This does not mean the AI becomes "less accurate", but rather that it will focus on broader trends and patterns. For more nuanced insights, we recommend using the (auto)-tagging feature to segment larger datasets into smaller, more manageable subsets.
Biases: Results may contain various biases. These can arise, for example, from unbalanced data in the model’s training, differences in language and expression, and a lack of contextual understanding. In particular, irony, sarcasm, dialect, youth slang or context-sensitive phrasing may be misinterpreted.

Our platform enables you to explore the core themes, summarize the data, and examine various perspectives. If you are looking for specific answers or insights, consider using the "Ask a Question" feature to dive deeper beyond the summary.

Summary: while AI-generated summaries are highly effective, they are never 100% accurate. Our design philosophy is human-centric: the AI assists with efficiency, while you retain full transparency and control over interpretation.

FAQ on our AI Sensemaking tool