Files
alexa-amundson-resume/roles/15-data-engineer.md
Alexa Amundson 292fa97a8e kpi: auto-update metrics 2026-03-13
RoadChain-SHA2048: 9f948f149bd9f508
RoadChain-Identity: alexa@sovereign
RoadChain-Full: 9f948f149bd9f508d25792c617d1c4049cf814c3acbb3181886684f1d89e2ab84fdb0364ce227ef1c03c0b59335e5d1aad9434f983ad375d50eca597e7daea8f9bb2a3e40116fa13de0453865ff2665fb759fc63204fe222360becc3b8c447fb1fbe7e10a440e8107745b57c643682cb2e4f7cffbb9c8c0e1bc5b03623fcbd41d0ab39740c02f148d5309591013f3d65810692706da448cf7e04b4368ef3738898fcc0f2414377cf1ff1f5897a27cfd96289c1f1875a3a93ec732453686f07621952135ae7df10cce155ebc206d3d3a3a9931fc7683d635c74b67d080fc170a8b8238a9eda91ba9193aaeb17737276b9140330cf622d656efdb3e968f46d1a24
2026-03-13 01:07:28 -05:00

74 lines
2.5 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Alexa Amundson
**Data Engineer**
amundsonalexa@gmail.com | [github.com/blackboxprogramming](https://github.com/blackboxprogramming)
---
## Summary
Data engineer building pipelines that collect, aggregate, and report 60+ KPIs daily across 9 data sources. Manages 283 databases (PostgreSQL, SQLite, D1, KV), FTS5 search indexes, and distributed tracing infrastructure across a 7-node fleet.
---
## Experience
### BlackRoad OS | Founder & Data Lead | 2025Present
**Data Pipelines**
- Built 9-collector KPI pipeline: GitHub API, Gitea API, SSH fleet probes, Cloudflare API, local system metrics
- Daily aggregation of 60+ metrics into timestamped JSON snapshots
- Day-over-day delta computation for trend tracking
- Multi-format output: terminal report, Slack (block kit), markdown, raw JSON
**Data Collection**
- GitHub collector: commits, PRs, events, languages, stars, forks via `gh` CLI API
- Fleet collector: Python probes piped over SSH to each node (CPU, RAM, disk, Docker, Ollama)
- Services collector: deep inspection of Ollama, Docker, PostgreSQL, Nginx, systemd per node
- Autonomy collector: self-healing events, cron jobs, timers, service restarts
- Cloudflare collector: D1 databases, KV namespaces, R2 buckets, Pages projects
**Data Storage**
- 11 PostgreSQL databases across fleet
- 230 SQLite databases (1.4 GB) with 111 registered systems
- 22 D1 databases (40 MB) for serverless applications
- 46 KV namespaces for edge state
- FTS5 index across 354 repos and 156,675 memory entries
**Data Quality**
- All KPI metrics machine-verified from live sources
- Automated collection via cron (6 AM daily) + GitHub Actions
- Snapshot retention: daily JSON files with full source data
- Distributed tracing with nanosecond-precision spans
**Reporting**
- Terminal dashboard with color-coded metrics and progress bars
- Slack webhook integration with block kit formatting
- Markdown report for GitHub README auto-update
- Historical comparison with delta indicators
---
## Technical Skills
**Pipelines:** Bash, Python, SSH, cron, GitHub Actions
**Databases:** PostgreSQL, SQLite/FTS5, Cloudflare D1, KV
**APIs:** GitHub API, Cloudflare API, Gitea API, Ollama API
**Formats:** JSON, Markdown, Slack Block Kit
**Tools:** gh CLI, Wrangler, Python (json, urllib), Bash
---
## Metrics
| Metric | Value |
|--------|-------|
| Daily KPIs tracked | 60+ |
| Data collectors | 9 |
| Databases managed | 283 |
| Data pipeline runs | Daily automated |
| Repos indexed | 354 |
| Memory entries | 156,675 |
| Report formats | 4 |