AI Disclosure Statement

Effective date: May 2026

What PageCrawl.io does

PageCrawl.io is a website change monitoring service. Change detection itself is deterministic: we compare page snapshots and flag the differences. No AI is involved in the detection.

On top of that, there is an AI feature that writes plain-language summaries of detected changes, so you don't have to read raw diffs. It is enabled by default and can be turned off in account settings at any time.

How the AI feature works

AI summaries are available on all plans. There are two ways to use them.

PageCrawl-managed AI. Included on all plans. We provide AI summaries through our own managed infrastructure, using enterprise AI APIs. No setup on your side. We act as the data controller and handle the agreements with the providers.

Bring Your Own Key (BYOK). Available on all plans. You can plug in your own API key from a supported provider (OpenAI, Anthropic, Google Gemini, OpenRouter). In that case, AI processing happens under your own agreement and billing with that provider.

In both cases, when a change is detected we send the before and after text to the AI provider, and the returned summary shows up next to the raw diff. You can disable the feature at any time.

Data and training

No training on customer data by PageCrawl.io. We do not collect, aggregate, or use customer data for any AI training, ever.

No training by the AI providers we use. We only use providers under enterprise API agreements that contractually prohibit training on the data we send them. Your content is processed to return a summary and is not used to improve any provider's models.

No cross-customer data sharing. Each AI request only uses the requesting customer's data. One customer's data never produces output for another.

BYOK users. Whether your AI provider retains or uses submitted data depends on your own agreement with them. Major providers (OpenAI, Anthropic, Google) exclude API traffic from training by default under their standard API terms.

Retention. AI providers may retain prompts for a short period (typically 30 to 55 days) for abuse monitoring and safety. That is the only retention; no customer data is kept beyond those operational windows.

Data ownership

All data collected through PageCrawl.io belongs to the customer. We do not sell, license, or share customer data with third parties. There are no pre-packaged datasets or starter packs derived from your monitoring. When you delete data or close your account, it is permanently removed from our systems.

Human oversight (GDPR Art. 22)

AI summaries are informational. They do not trigger actions, make decisions, or replace the underlying data. The raw change data (visual diff, text diff, snapshots) is always there for you to review. AI summaries can be disabled at any time.

The feature is not automated decision-making under GDPR Article 22. No decisions with legal or significant effects on individuals are made by the system.

Bias and discrimination

The AI feature summarises text changes on web pages. It does not process personal data, profile individuals, or make assessments about people. Bias and discriminatory outcomes do not apply to this use case.

Accuracy

AI summaries are a convenience layer. The authoritative record of what changed is always the deterministic diff. Summaries can occasionally be imprecise or incomplete; that is a known property of current LLMs. You can always check any summary against the raw data.

API key security

BYOK keys. Customer API keys are encrypted at rest, sent over HTTPS only, never stored in plaintext, and never logged. Keys are decrypted in memory only for the duration of the API call.

Managed AI. Our own API credentials are stored securely and are not accessible to customers. All traffic to AI providers goes over HTTPS.

Contact

Questions about this document or our use of AI: hey@pagecrawl.io