Open sourceAnalytics

CRM Analytics Tool

An end-to-end customer-analytics system that turns millions of raw transaction rows into monthly segmentation, lifecycle, and churn signals — driven by transparent, brand-configurable rules.

PythonSQLSQLitePlotly.js

My role: Designed and built the system, modeled on CRM analytics I delivered for a Nespresso distributor.

Open simulation ↗GitHub ↗

Executive summary

CRM Analytics is a complete customer-analytics system: feed it raw customer, product, and transaction data — millions of rows — and it returns monthly, decision-ready segmentation, lifecycle, and churn signals for every customer.

It replaces the usual cycle of exporting CSVs and hand-building one-off pivot tables with a reproducible, auditable pipeline. Every number traces back to a documented rule, so the business and the code agree on exactly what "a Gold customer" or "a churn risk" means.

What it looks like

crm-analytics · monthly snapshot

1.2MCustomers

AED 84M12-mo value

18%At-risk

4RFM tiers

Champions · 22%Loyal · 28%At-risk · 18%Dormant · 32%

Illustrative mock-up of the CRM analytics dashboard — KPIs, monthly trend and RFM segments.

The analytics it produces

Each month, every customer is scored on independent dimensions built on an RFMT model (Recency, Frequency, Monetary, Tenure), where consumable purchases drive engagement and device or accessory sales are deliberately excluded from the segmentation.

Value tierFive tiers — Diamond, Platinum, Gold, Silver, Bronze — derived from average monthly consumption, order size, and six-month order frequency, with a passive state for inactive customers.
Activity statusActive vs. inactive, based on whether a customer purchased within the trailing 12 months.
Lifecycle eventsNew, Lost, and Reactivated flags, recomputed every month from cumulative purchase windows.
Churn riskAt-risk high-value customers are flagged for intervention before they lapse.
TrendsValue-tier mix and lifecycle shifts tracked across rolling 12-month periods.

Brand-configurable rules & documentation

The segmentation logic is defined as data, not buried in code. A single business-logic specification is the source of truth for every threshold and definition — the project's principle is "rules first, code second."

Because the rules are parameterized, each brand can tune its own thresholds — what counts as Diamond, what window defines churn — and the entire history recalculates deterministically. The specification is published as documentation alongside the dashboard, so anyone can audit exactly how a number was produced.

Business-logic specSegmentation rules and KPI definitions in one authoritative document.
Source & schema specsInput file formats and the normalized database schema, fully documented.
Rendered with the appSpecs are rendered to HTML and shipped beside the dashboard for easy auditing.

Backend & data pipeline

The backend is a deterministic, three-stage pipeline: raw data goes in, and pre-computed monthly snapshots plus a static dashboard come out — with no server required at runtime.

Ingest

Customers, products & transactions CSVs
Or a seeded synthetic-data generator
Loaded into normalized SQLite tables

Compute

Parameterized, set-based SQL
12-month orchestrator (build.py)
Per-customer monthly snapshots

Publish

Snapshot JSON exports
Plotly.js + Bootstrap dashboard
Specs rendered to HTML, screenshots

Built for millions of rows

Rather than querying transactions live, the system pre-aggregates them into monthly per-customer snapshots. Millions of transaction rows collapse into compact, query-ready summaries, so the dashboard stays instant no matter how large the underlying history grows.

Calculations run as plain, set-based SQL over SQLite, and the Python orchestrator replays them month by month to build a full 12-month history in one reproducible build.

The dashboard

The output is a static, mobile-friendly dashboard: a 12-month value-tier trend, lifecycle-event charts, and headline KPIs such as new customers, churn risk, and high-value accounts at risk — with filtering and CSV export, and no signup or backend to run.

CRM Analytics dashboard with KPI cards, a value-tier donut, lifecycle-event bars, and a per-tier KPI table — Live dashboard from the open-source demo — all figures are synthetic.

Tech stack

PythonSQLSQLitepandasPlotly.jsBootstrap 5Jinja2GitHub Pages

Outcome

The result is a transparent customer-analytics engine that any team can run on its own data with three commands. Segmentation, lifecycle, and churn signals that once took manual spreadsheet work are produced automatically every month, with every figure traceable to a published rule.

The whole system is open source and ships with a live, synthetic-data demo — so the logic can be inspected end to end without exposing any real customer data.