Data quality

Fake event protection

Fabricated events reach analytics through the Measurement Protocol, replayed beacons, or scripted bots. Because collection endpoints accept well-formed requests by default, defense relies on validation: allow-listing hostnames, checking event shape, and flagging implausible patterns. This page describes layered protections that keep fake events out of trusted totals without claiming any single control is foolproof.

Partially verified

How fakes get in

Analytics endpoints are designed to accept any well-formed beacon so that real browsers are never blocked. That openness lets fabricated events in: a leaked Measurement Protocol secret, a replayed collection URL, or a headless bot firing the same payloads can all look syntactically valid. There is no single signal that proves an event is human, so protection is layered.

The goal is not perfect exclusion but keeping fabricated volume out of the numbers people trust.

Open endpoints accept any well-formed beacon
Sources include MP injection, replays, and bots
No single check proves an event is human

Layered protections

Allow-list the hostnames that may originate events so off-domain hits are dropped. Validate event shape against your tracking plan and reject unknown names or wrong types. Watch for plausibility violations — conversions without preceding interactions, impossible geographies, identical repeated payloads — and quarantine rather than count them. Combine server-side bot classification with these checks so each catches what others miss.

This builds on Measurement Protocol spam and api_secret handling: those describe the threat, this describes the defense.

How it appears in analytics and logs

Events from hostnames you do not own, impossible sequences, or replayed identical payloads point to fabricated rather than genuine activity.

Diagnostic use case

Reduce the share of fabricated events in analytics by validating hostname, event shape, and plausibility before counting them as real.

What WebmasterID can help detect

WebmasterID classifies events server-side and separates bot and injected hits from the human total by default.

Common mistakes

Relying on one control as a complete defense.
Counting off-domain hostnames as real traffic.
Accepting events whose shape violates the tracking plan.

Privacy and accuracy notes

Plausibility checks should use event metadata, not invasive fingerprinting. This page is educational, not legal advice.

↑ All data-quality topics in Data quality

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.