Files
alexa-amundson-resume/roles/15-data-engineer.md
Alexa Amundson d5c8667284 20 role-specific resumes with verified KPIs — BlackRoad only, no prior experience
RoadChain-SHA2048: 428ab11c02ce78d6
RoadChain-Identity: alexa@sovereign
RoadChain-Full: 428ab11c02ce78d628aa30489d9f0f3251e709352f2deacf05882435ed9f5d114fe2a1c9e75b3c831688f47cd9032c22b388f821b1b29dcac9fc9a3ad4a1b39f1210d1275f9472df606b763bb551961d1eaebfe8f2a4b9c23d3f3da3f001d916e03ff920def04c8304d8544ac916e4c50c16da942dcc830388e298b7c016b991320b30f7d3fe153aaab71ab109aea3f9dca996ac6e14ca1c0969248c8ca2767ab631c17dc86c0c2a8edd1c8965ab3ba6c92ba7cc9aa4d74406058a39d8fdec53a200371b7d1e1214a860a7ff2c53b83b09f516cec69cbe00e3556caee7f813e4a09d3f430a3a3eab5d4763f8975999c31bd77f82972ab8d7c2d7c5aedcce9442
2026-03-13 00:01:11 -05:00

2.5 KiB
Raw Blame History

Alexa Amundson

Data Engineer

amundsonalexa@gmail.com | github.com/blackboxprogramming


Summary

Data engineer building pipelines that collect, aggregate, and report 60+ KPIs daily across 9 data sources. Manages 283 databases (PostgreSQL, SQLite, D1, KV), FTS5 search indexes, and distributed tracing infrastructure across a 7-node fleet.


Experience

BlackRoad OS | Founder & Data Lead | 2024Present

Data Pipelines

  • Built 9-collector KPI pipeline: GitHub API, Gitea API, SSH fleet probes, Cloudflare API, local system metrics
  • Daily aggregation of 60+ metrics into timestamped JSON snapshots
  • Day-over-day delta computation for trend tracking
  • Multi-format output: terminal report, Slack (block kit), markdown, raw JSON

Data Collection

  • GitHub collector: commits, PRs, events, languages, stars, forks via gh CLI API
  • Fleet collector: Python probes piped over SSH to each node (CPU, RAM, disk, Docker, Ollama)
  • Services collector: deep inspection of Ollama, Docker, PostgreSQL, Nginx, systemd per node
  • Autonomy collector: self-healing events, cron jobs, timers, service restarts
  • Cloudflare collector: D1 databases, KV namespaces, R2 buckets, Pages projects

Data Storage

  • 11 PostgreSQL databases across fleet
  • 230 SQLite databases (1.4 GB) with 111 registered systems
  • 22 D1 databases (40 MB) for serverless applications
  • 46 KV namespaces for edge state
  • FTS5 index across 354 repos and 156,675 memory entries

Data Quality

  • All KPI metrics machine-verified from live sources
  • Automated collection via cron (6 AM daily) + GitHub Actions
  • Snapshot retention: daily JSON files with full source data
  • Distributed tracing with nanosecond-precision spans

Reporting

  • Terminal dashboard with color-coded metrics and progress bars
  • Slack webhook integration with block kit formatting
  • Markdown report for GitHub README auto-update
  • Historical comparison with delta indicators

Technical Skills

Pipelines: Bash, Python, SSH, cron, GitHub Actions Databases: PostgreSQL, SQLite/FTS5, Cloudflare D1, KV APIs: GitHub API, Cloudflare API, Gitea API, Ollama API Formats: JSON, Markdown, Slack Block Kit Tools: gh CLI, Wrangler, Python (json, urllib), Bash


Metrics

Metric Value
Daily KPIs tracked 60+
Data collectors 9
Databases managed 283
Data pipeline runs Daily automated
Repos indexed 354
Memory entries 156,675
Report formats 4