Skip to content
View Quazmoz's full-sized avatar
🎏
Focusing
🎏
Focusing

Block or report Quazmoz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Quazmoz/README.md

Hi, I'm Quinn πŸ‘‹

Senior Automation Engineer building AI + DevOps systems, local LLM infrastructure, MCP workflows, and focused utility products.

I turn operational friction into software: self-hosted AI control planes, automation platforms, local inference stacks, and small apps that solve specific real-world problems.

Portfolio Consulting LinkedIn YouTube


What I Build

I work at the intersection of automation engineering, AI infrastructure, DevOps platforms, and small utility products.

  • AI agent infrastructure β€” memory layers, MCP servers, prompt/skill systems, local-agent workflows
  • Local/private LLM stacks β€” OpenAI-compatible APIs, OpenVINO, Windows AI hardware, GPU/NPU experimentation
  • DevOps automation platforms β€” Terraform, Ansible, Kubernetes, CI/CD, observability, internal tooling
  • Operator-first apps β€” focused Android / Wear OS tools with privacy-first architecture and Play Store readiness
  • Consulting/product systems β€” portfolio, technical content, and practical AI infrastructure offers

Build the tool once. Remove the task forever.


Featured Public Repositories

These public repos best represent my current direction.

Project What it is Stack / Focus Status
MemoryOps Self-hosted memory control plane for AI agents Rust, Postgres, Redis, Qdrant, MCP, control UI Alpha
OpenVINO Windows LLM Local OpenAI-compatible LLM server for Intel Windows PCs Python, OpenVINO GenAI, Windows, CPU/GPU/NPU, web UI Working
AgentDefaults Reusable defaults for AI agents, skills, prompts, wrappers, and token-efficiency workflows Markdown, validation scripts, AI agent UX Active
npu-windows Earlier Intel NPU local LLM experiment and predecessor to the OpenVINO stack Windows, Intel NPU, local AI Legacy / reference

Flagship Work

MemoryOps β€” AI Agent Memory Control Plane

MemoryOps is my self-hosted memory layer for AI agents. It is designed for teams that need governed, inspectable, token-aware context retrieval across engineering activity instead of one-off prompt stuffing or unmanaged vector search.

Focus areas:

  • Structured ingestion from engineering systems and agent observations
  • Episodic β†’ semantic memory lifecycle management
  • Hybrid retrieval, feedback, decay, pinning, pruning, and auditability
  • MCP/API access so coding agents can retrieve and store memory directly
  • Operator UI for visibility into what agents remember and why

OpenVINO Windows LLM β€” Local AI Workstation Stack

OpenVINO Windows LLM turns Intel Windows PCs into practical local AI workstations. It wraps OpenVINO GenAI in an OpenAI-compatible server with streaming chat, model lifecycle management, a built-in browser UI, conversion helpers, and direct targeting for Intel CPU, GPU, NPU, and AUTO devices.

Use cases:

  • Open WebUI / n8n / LangChain / Continue integration
  • Local-first AI experimentation without cloud APIs
  • Intel NPU/GPU/CPU device routing tests
  • Laptop-friendly model conversion and serving workflows

AgentDefaults β€” Reusable AI Engineering Defaults

AgentDefaults is a practical library of reusable agent profiles, skills, prompts, wrappers, examples, and validation patterns. It is built around a simple idea: AI engineering improves when default behavior is reusable, testable, and benchmarkable.

It includes:

  • Repo-aware coding-agent defaults
  • Claude / Gemini / Copilot / editor wrappers
  • Token-efficiency skills and measurement prompts
  • Reusable task profiles for DevOps, research, SEO, coding, and documentation
  • Validation scripts for keeping prompt libraries maintainable

Private / Product-Stage Work

Not everything I build is public. A lot of my most active work lives in private repos because it includes product-stage code, app-store release details, security-sensitive automation, or client-style infrastructure patterns.

Current private/product areas include:

Area What it demonstrates
Wear OS utility apps Kotlin, Jetpack Compose, Play Billing, tiles, complications, haptics, Android release engineering
WebhookDeck Secure wrist-based webhook triggers for operators, homelab users, and automation workflows
WristSense / WristNote / WristDash / WristLux Sensor dashboards, voice notes, Home Assistant controls, light metering, phone-watch sync
FidgetDrop Privacy-first Android haptic product with one-time Pro unlock and local-only state
InfraOrbit Local-first AI command center for DevOps and platform engineering teams
Consulting site AI infrastructure consulting hub, technical articles, videos, app pages, legal/privacy docs

The public repos are the visible slice. The private repos are where the productization, release hardening, and app-store workflows happen.


Tech Stack

Layer Tools
Cloud & Infra Azure, AWS, OCI, Terraform, Ansible, Helm
Containers & Platforms Docker, Kubernetes, AKS, GitHub Actions, Azure DevOps, Jenkins
Languages PowerShell, Python, Go, Rust, TypeScript, Kotlin, Bash
AI / Agents MCP, FastMCP, local LLMs, OpenVINO, Open WebUI, n8n, Codex, Claude, Gemini, Perplexity
Backends Django REST Framework, FastAPI, Go HTTP services, Postgres, Redis, Qdrant
Frontends React, Vite, TypeScript, Flutter, Jetpack Compose, Compose for Wear OS
Observability Grafana, Prometheus, Sentry, structured logging, health checks
Mobile / Wearables Android, Wear OS, Play Billing, Data Layer sync, tiles, complications, haptics

Current Direction

  • Building self-hosted memory and context systems for AI agents
  • Making infrastructure more agent-accessible through MCP, APIs, and typed control surfaces
  • Running and evaluating local/private AI on Windows, GPU, and NPU hardware
  • Turning DevOps patterns into productized internal platforms
  • Shipping narrow, privacy-first utility apps instead of vague demos
  • Creating technical content around AI infrastructure, automation, and local LLM workflows

Elsewhere


Automation over ceremony. Utility over novelty. Systems that compound.

Pinned Loading

  1. npu-windows npu-windows Public

    Please see the newer: https://github.com/Quazmoz/openvino-windows-llm

    Python 21 6

  2. K8SHomelab K8SHomelab Public

    GitOps Kubernetes homelab with Flux CD, local LLM inference, MCP agent infrastructure, monitoring stack, and self-hosted tooling on bare-metal + Oracle Cloud

    Python 1

  3. groupme-mcp groupme-mcp Public

    Go

  4. memoryops memoryops Public

    Memory Operations Platform for AI agents β€” ingestion, lifecycle, token-aware retrieval, and control UI

    Rust 2

  5. openvino-windows-llm openvino-windows-llm Public

    Windows-first OpenAI-compatible local LLM server powered by OpenVINO GenAI for Intel CPU/GPU/NPU, with chat UI, model conversion, and setup scripts.

    Python 3 2

  6. agentdefaults agentdefaults Public

    Reusable AI agent defaults, prompts, skills, wrappers, and MCP video-editing workflows for coding, DevOps, token efficiency, and Palmier Pro automation.

    Python