Files
alexa-amundson-resume/roles/15-data-engineer.md
Alexa Amundson ec7b1445b5 kpi: auto-update metrics 2026-03-13
RoadChain-SHA2048: c645c1292ab1555e
RoadChain-Identity: alexa@sovereign
RoadChain-Full: c645c1292ab1555ebe6982915536d1c94701ff6bb16c20ed6ef4144eb50c9f984b4bfe5b9902109e8defd958d6be43ced8ec11cf95d6241536cd4da0b75f8fb48cbeb1b9f450c8f665b73d39e837d23e73e2ba4201af4dc40c02a34283efb04b39c612083465536f194f16adfadb1b56f714a65b918f40750f54eebf7724236861de173ec31963ff3b1b988d712be7e5acc3fe391eb804d3fdcfb9ccf77afc732660d23fff801f894318327eabf775eb4f4e67f7f22d07f23b0e17f6594cfe95b83b275fb7baaa97115e86562604fc5b47cc8024574b61396924e0ee2b7e454b0a1480c3076c7ad72408ceb4a75360d2d49c7d805c37ac5315af00e4a8ca2262
2026-03-13 23:16:12 -05:00

2.2 KiB
Raw Blame History

Alexa Amundson

Data Engineer

amundsonalexa@gmail.com | github.com/blackboxprogramming


Summary

Needed to prove every metric on every resume. Built a 10-collector pipeline that pulls from GitHub API, SSH fleet probes, Cloudflare CLI, and local system — 80+ KPIs aggregated daily, pushed to KV, served live on 20 dashboards.


Experience

BlackRoad OS | Founder & Data Engineer | 2025Present

The Problem: Unverifiable Claims Don't Get Hired

  • Resumes say "managed 200+ services" but nobody can verify it. Needed machine-verified metrics with traceable sources
  • Built 10 automated collectors: GitHub, GitHub-deep, all-orgs, Gitea, fleet, services, autonomy, LOC, local, Cloudflare
  • Each collector runs independently, outputs JSON snapshots. Daily aggregation merges into a single file with 80+ keys. Every number has a source

The Pipeline: Collect \u2192 Aggregate \u2192 Serve

  • Fleet probes: Python scripts piped over SSH stdin to remote nodes — avoids shell quoting issues, runs on any node without installing anything
  • Cloudflare inventory: wrangler CLI queries Pages, D1, KV, R2 counts. GitHub API: paginated queries across 17 organizations, deduped
  • Daily JSON pushed to Cloudflare KV → Worker serves 20 live resume dashboards. Every number on this page updated automatically at 6 AM

The Scale: 283 Databases, One Pipeline

  • 283 databases across PostgreSQL, SQLite, D1, KV, Qdrant — each one discovered, counted, and tracked by the collectors
  • FTS5 full-text search across 156K entries. 111 registered systems. Day-over-day deltas show trends, not just snapshots

Technical Skills

Python, PostgreSQL, SQLite/FTS5, Cloudflare D1, data pipelines, SSH probes, JSON, Bash


Metrics

Metric Value Source
Lines of Code live loc.sh — cloc + fleet SSH
Total Repos live github-all-orgs.sh — gh api repos (17 owners)
PostgreSQL DBs live services.sh — psql -l via SSH
SQLite DBs live local.sh — find ~/.blackroad -name *.db
Total DB Rows live local.sh — sqlite3 row count across 230 DBs
D1 Databases live cloudflare.sh — wrangler d1 list --json