Matomo: open-source, self-hostable analytics
Matomo (formerly Piwik) is an open-source web analytics platform that can be self-hosted on your own server or used as a hosted cloud service. Self-hosting means the visitor data lives in your own database rather than a vendor's. It offers a session/visit-based model with familiar reports plus optional add-ons, and configurable options such as cookieless tracking and IP anonymization.
What this means
Matomo records visits and actions in a relational database. The defining choice is deployment: the self-hosted edition runs on your own PHP/MySQL stack so the raw data never leaves your infrastructure, while Matomo Cloud is a vendor-hosted option.
Reports resemble traditional web analytics — visitors, visits, actions, referrers, and goals — with add-ons for heatmaps, funnels, and more.
What to weigh
Self-hosting gives data ownership and control over retention but means you operate and update the software yourself. Matomo supports configurable privacy controls, including a cookieless mode and IP anonymization, which can reduce the consent surface depending on settings.
- Self-host (own DB) or use Matomo Cloud
- Visit/action model with goals and optional add-ons
- Configurable cookieless mode and IP anonymization
Migration notes
When moving from a hosted tool, expect definitional differences: Matomo's 'visits' are not identical to another tool's 'sessions', and goal definitions must be re-created. Plan how you import or restart historical data, since self-hosted history starts when your instance does.
How it appears in analytics and logs
Matomo numbers come from your own instance when self-hosted, so discrepancies versus a hosted tool reflect where collection runs and how cookies/anonymization are configured, not vendor sampling.
Diagnostic use case
Consider Matomo when data ownership or self-hosting matters and you want familiar visit-based reports without sending raw data to a third party by default.
What WebmasterID can help detect
WebmasterID shares the first-party, data-ownership mindset and adds AI-crawler and bot intelligence; this page describes Matomo even-handedly for a migration decision.
Common mistakes
- Assuming self-hosting removes all maintenance and update work.
- Comparing Matomo 'visits' to another tool's 'sessions' as identical.
- Forgetting to configure anonymization before relying on it for compliance.
Privacy and accuracy notes
Matomo can be configured for cookieless tracking and IP anonymization; self-hosting keeps data on infrastructure you control. Configuration and region still determine your obligations. This is factual, not legal advice.
Related pages
- Plausible: lightweight, privacy-focused analytics
Plausible is an open-source, cookieless, privacy-focused analytics tool. It deliberately keeps a small script and a simple metric set (visitors, pageviews, sources, top pages) and avoids cookies and cross-site identifiers. The trade-off is intentional: less granularity and individual-level depth in exchange for simplicity and a smaller privacy surface.
- Cookieless analytics: how it works and its limits
Cookieless analytics records visits and events without setting cookies or persistent cross-site identifiers. It relies on first-party, server-side signals and aggregate counting. The trade-off is honest: it cannot follow an individual across sessions the way cookie-based tracking can — which is exactly the point for privacy-first measurement.
- Self-hosted vs cloud analytics
Choosing between self-hosted and cloud (vendor-hosted) analytics is mainly a trade-off between data ownership and operational effort. Self-hosting keeps raw data in your own database and gives you control over retention, but you run, secure, and update the software. Cloud is operated for you but the data lives with the vendor. Neither is universally better.
- Privacy-first analytics
First-party, privacy-respecting measurement.
Sources and verification notes
Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.