All work
Data Monitoring Platforms · Mar 2025 – May 2026

DataQRL

Enterprise data governance platform — 7 modules, real-time integrity, entity resolution at scale.

Role
Co-founder, CPO & CTO
TypeScriptPythonNestJSFastAPISplinkPostgresNeo4jKafkaRedisDockerK8s
Problem

What was broken.

Enterprise data teams lose weeks reconciling records across systems. Existing MDM tools are heavyweight, opaque, and require expensive consultants. Mid-market companies need governance that scales with them — not against them.

Architecture

How it's wired.

Event-driven core. DataPulse streams change-data-capture into a validation engine (Sentinel). ReconFlow runs Splink-based probabilistic matching with hierarchical fidelity scoring. SignalWatch wraps anomaly detection on top. Cortex orchestrates AI agents over the lake; CaseHub gives investigators a cockpit; Data Studio handles remediation. All seven modules share a typed contract layer.

Build

What I shipped.

  • DataPulse™ — real-time integrity monitoring over CDC streams
  • ReconFlow™ — distributed entity resolution & data lineage (Splink + custom scoring)
  • SignalWatch™ — anomaly detection with adaptive thresholds
  • Cortex™ AI orchestrator + Sentinel validation engine
  • Data Studio remediation workstation & CaseHub investigative cockpit
  • Closed-loop verification processing millions of records
Outcomes

What changed.

7
Integrated modules shipped
M+
Records resolved per pipeline
0→1
Built and led from scratch
Tradeoffs

Why these choices.

Chose Splink over Zingg for explainability. Picked Neo4j for lineage graphs (vs JSON in Postgres) once depth > 4. Kept the agent layer behind a strict tool interface so swap from OpenAI to local models is one config change.