Dataform Cash-Forecast (BigQuery + Dataform + Looker/Omni)
Oct 16, 2025

Project Type :Subcontracting
Project Timeline : May 2025 – December 2025
Context & Objectives
The client is a Fortune-1000 enterprise with complex financial operations across multiple business units and regions, managing Accounts Receivable, Payable, Payroll, and Collections data from various ERP systems. Cash forecasting was fragmented and heavily spreadsheet-driven, leading to inconsistent results, manual reconciliations, and limited visibility into actual liquidity.
To address these gaps, the project aimed to build a governed, auditable cash-forecasting engine in BigQuery/Dataform, replacing manual spreadsheets with version-controlled SQL models and automated backtesting. The solution delivers executive-grade dashboards and finance workbooks through Looker/Omni, providing drill-downs into key drivers like payment terms, delinquency, and payroll timing. The platform ensures end-to-end governance, auditability, and SOX-compliant security, enabling a single, reliable source of truth for enterprise cash management.
Project Goals
The project aimed to modernize and streamline the client’s cash forecasting process — moving away from scattered spreadsheets to a unified, governed, and automated analytics framework. By combining BigQuery, Dataform, and Looker/Omni, the solution was designed to give finance teams a single, trusted view of cash flow while improving accuracy, compliance, and decision-making.
Build a Unified and Reliable Forecast Engine
Consolidate AR, AP, Payroll, and Bank data from multiple ERPs into a centralized forecasting model in BigQuery. This created a single, auditable view of cash positions across business units and reduced the dependency on manual spreadsheets.Enhance Accuracy, Governance, and Automation
Introduce version-controlled Dataform models, automated transformations, and backtesting logic to ensure consistent, reproducible, and SOX-compliant forecasting cycles. This improved forecast accuracy while reducing manual intervention during month-end closes.Enable Insightful and Cost-Efficient Decision Making
Deliver executive dashboards and finance workbooks through Looker/Omni with drill-downs into cash drivers like payment terms, delinquency, and payroll timing. Optimized BigQuery design (partitioning, clustering, and cost guardrails) ensured high performance and predictable costs.
Business Problem
Cash forecasting and reconciliation were spreadsheet-driven, inconsistent, and manually intensive. Each finance team maintained its own version of forecast models, causing discrepancies, long refresh cycles, and delays in identifying variances between forecasted and actual cash positions.
Reconciling forecasts with actual cash movements at month-end required manual effort and was prone to errors — slowing the close process and reducing confidence in financial projections.
Constraints & Non-Functional Requirements
Aspect | Requirement |
|---|---|
Governance & Audit | Versioned models, reproducible runs, immutable month-end snapshots, SOX-friendly lineage |
Freshness & Latency | Daily refresh for core models, weekly forecast cycles, 95th percentile dashboard latency ≤ 3s |
Reliability | Idempotent incremental merges, deterministic recovery, audit tables, and safe backfills |
Security | IAM roles, RLS/ABAC for business units and regions, PII masking for vendors and employees |
Outcomes & Business Impact
Area | Impact |
|---|---|
Close Acceleration | Reduced manual spreadsheet effort, faster month-end close, fewer rework loops |
Forecast Accuracy | Backtesting models with measurable error bands; finance teams can tune assumptions |
Cash Predictability | Enhanced visibility into collections timing and cash pull-forward opportunities |
Single Source of Truth | Reconciled, board-ready cash views accessible to Finance, RevOps, and Leadership |
Tech Stack
Layer | Tools & Components |
|---|---|
Data Generation / Sources | AR/AP ledgers, invoices, POs, payroll, billing & collections, bank feeds, ERP exports |
Ingestion | Batch jobs (APIs/files) into BigQuery raw; late-data handling and audit columns |
Storage / Lakehouse | BigQuery datasets (raw → staging → EDW); snapshot tables for month-end close |
Orchestration | Dataform schedules with datagroups aligned to upstream arrivals |
Transformation / Modeling | Dataform models with contracts, incremental merges, and snapshots; cash ladder logic; aging and terms |
Serving / Consumption | Looker/Omni dashboards, finance workbooks, RLS/ABAC-based access by BU/region |
Governance & Security | IAM-scoped datasets, PII masking, audit logs, SOX-compliant lineage |
Observability / Quality | Reconciliation packs (GL tie-outs, variance checks), freshness monitors, backtesting dashboards |
DevOps / CI-CD | Git-managed Dataform repo; environment promotion via change tickets; version-controlled assumptions |
Cost / FinOps | Partitioned & clustered tables, bounded default ranges, slot/byte telemetry, cost guardrails |
Target Architecture (High-Level)
Flow:
Financial Sources → Ingestion & Staging (Raw → Staging in BigQuery) → Modeling (Dataform)
→ Forecast Engine (cash ladder logic across AR/AP/PO/Payroll + backtesting)
→ Serving & Access (Looker/Omni dashboards & finance workbooks, RLS-controlled)
Supporting Lanes:
Quality & Reconciliation, SLOs & Scheduling (datagroups), Security & Governance.
(Diagram Reference: “Dataform Cash-Forecast — Architecture (PNG)”)
Data Model & Semantics (Highlights)
Core Domains
AR: Invoices, credit memos, receipts
AP: Bills, payments
POs, Payroll, Bank Feeds
Core Entities
fact_invoice, fact_receipt, fact_bill, fact_payment, fact_po, fact_payroll, dim_customer, dim_vendor, fiscal_calendar
Forecast Semantics
Cash Ladder Logic: Projects expected cash dates using invoice terms, historical DSO patterns, and delinquency curves.
Aging Buckets: Categorized into 0–30, 31–60, 61–90, 90+ with cohort-based roll-forwards.
Forecast vs Actuals: Snapshot tables for each run, enabling backtesting and drift analysis.
Scenario Modeling: Conservative, baseline, and aggressive assumptions with shock factors for seasonality or policy changes.
Ops, Security, Quality & Performance
Category | Implementation |
|---|---|
Ops & Scheduling | Datagroups aligned with upstream data arrival; daily core refresh; weekly forecast recomputation; safe backfills |
Security & Access | IAM-scoped datasets; RLS/ABAC for BU/region; PII masking; query audit logs |
Quality & Reconciliation | Bronze/Silver/Gold parity checks; GL tie-outs; variance views for collections vs cash; freshness monitors |
Performance & Cost | Partitioned & clustered BigQuery tables; pre-aggregations for frequent queries; bounded date ranges; telemetry guardrails |
Deliverables
Dataform Repository – End-to-end SQL models (
raw → stg → edw) with contracts, snapshots, and incremental logicBacktesting Suite – Forecast vs. actual dashboards for model validation
Executive Dashboards (Looker/Omni) – Finance workbooks with drill-downs by region, BU, and term
Governance Artifacts – Data contracts, reconciliation pack, freshness monitors, and datagroup policies
Runbooks – Recovery, backfill, and change management procedures
Conclusion
By transitioning from spreadsheet-based forecasting to a governed Dataform-BigQuery engine, the client achieved faster closes, audit-ready lineage, and data-driven cash visibility.
Finance teams gained the ability to backtest assumptions, quantify forecast accuracy, and proactively manage liquidity.
Executives now rely on a single, reconciled view of cash, driving confident financial decisions and improved working capital efficiency.