sync: Add Wrightstown Solar and Smart Home projects

New projects from 2026-02-09 research session: Wrightstown Solar: - DIY 48V LiFePO4 battery storage (EVE C40 cells) - Victron MultiPlus II whole-house UPS design - BMS comparison (Victron CAN bus compatible) - EV salvage analysis (new cells won) - Full parts list and budget Wrightstown Smart Home: - Home Assistant Yellow setup (local voice, no cloud) - Local LLM server build guide (Ollama + RTX 4090) - Hybrid LLM bridge (LiteLLM + Claude API + Grok API) - Network security (VLAN architecture, PII sanitization) Machine: ACG-M-L5090 Timestamp: 2026-02-09 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-10 18:44:35 -07:00
parent fee9cc01ac
commit aaf4172b3c
12 changed files with 1953 additions and 0 deletions
--- a/projects/wrightstown-smarthome/PROJECT_INDEX.md
+++ b/projects/wrightstown-smarthome/PROJECT_INDEX.md
@@ -0,0 +1,139 @@
+# Wrightstown Smart Home - Project Index
+
+**Project:** Smart Home + Local LLM + Hybrid AI Bridge
+**Location:** Wrightstown (home)
+**Status:** Planning / Research Phase
+**Created:** 2026-02-09
+**Last Updated:** 2026-02-09
+
+**Related Project:** `projects/wrightstown-solar/` (Victron integration crossover)
+
+---
+
+## Project Overview
+
+Privacy-first smart home system using Home Assistant Yellow with local voice control, a dedicated local LLM server, and a hybrid routing bridge that keeps private data local while leveraging Claude and Grok APIs for tasks requiring external search or heavy reasoning.
+
+## Design Goals
+
+1. No Google/Alexa dependency -- fully local voice control
+2. Local LLM for private queries (home automation, sensor data, cameras)
+3. Hybrid bridge to Claude API (reasoning) and Grok API (internet search)
+4. Privacy-first: private data never leaves the network
+5. Network segmentation (VLANs) for IoT isolation
+6. Integration with Victron solar system (future crossover with wrightstown-solar project)
+
+## Hardware Inventory
+
+| Component | Status | Notes |
+|---|---|---|
+| Home Assistant Yellow | Owned (not set up) | Needs CM4/CM5 module |
+| LLM Server | Not purchased | RTX 4090 build recommended |
+| Network gear (VLANs) | TBD | Ubiquiti or TP-Link Omada |
+| Wyoming voice satellites | Not purchased | For room-by-room voice control |
+
+## Architecture
+
+```
+            Voice Satellites (Wyoming)
+                    |
+            [Home Assistant Yellow]
+              /     |        \
+        Zigbee    Victron    LLM Integration
+        devices   Modbus     (Extended OpenAI Conversation)
+                                |
+                        [LiteLLM Proxy]
+                         (routing layer)
+                        /       |       \
+                  [Ollama]  [Claude]  [Grok]
+                  Local     Anthropic  xAI
+                  $0/mo     API        API
+                  Private   Reasoning  Search
+```
+
+---
+
+## Files
+
+### Documentation
+| File | Description |
+|---|---|
+| `documentation/ha-yellow-setup.md` | Home Assistant Yellow setup and voice assistant config |
+| `documentation/llm-server-build.md` | Local LLM server hardware, software, and model selection |
+| `documentation/hybrid-bridge.md` | LiteLLM routing, Claude/Grok API integration, privacy layer |
+| `documentation/network-security.md` | VLAN architecture, IoT isolation, PII sanitization |
+
+### Session Logs
+| File | Description |
+|---|---|
+| `session-logs/2026-02-09-session.md` | Initial research and planning session |
+
+---
+
+## Budget Estimate
+
+### Phase 1: Home Assistant Yellow Setup
+| Component | Est. Cost |
+|---|---|
+| RPi CM4 4GB/32GB (if not owned) | $45-75 |
+| NVMe SSD (512GB-1TB) | $40-80 |
+| **Phase 1 Total** | **$45-155** (Yellow already owned) |
+
+### Phase 2: LLM Server (Sweet Spot Build)
+| Component | Est. Cost |
+|---|---|
+| RTX 4090 24GB | $1,200-1,500 |
+| Ryzen 7 5800X3D + B550 + 64GB DDR4 | $490 |
+| Storage, PSU, case | $190 |
+| **Phase 2 Total** | **$1,880-2,180** |
+
+### Phase 3: Network + Voice
+| Component | Est. Cost |
+|---|---|
+| VLAN-capable router + managed switch | $150-760 |
+| Wyoming voice satellites (2-3 rooms) | $100-200 |
+| **Phase 3 Total** | **$250-960** |
+
+### Ongoing Monthly
+| Item | Est. Cost |
+|---|---|
+| Cloud API (Claude + Grok, 80/15/5 split) | ~$5/mo |
+| Electricity (LLM server) | ~$15-30/mo |
+| **Monthly Total** | **~$20-35/mo** |
+
+---
+
+## Decision Log
+
+| Date | Decision | Rationale |
+|---|---|---|
+| 2026-02-09 | Home Assistant over Google/Alexa | Privacy, local control, open source |
+| 2026-02-09 | HA Yellow (already owned) | Built-in Zigbee, NVMe slot, still receives updates |
+| 2026-02-09 | Ollama for local LLM | Simplest setup, OpenAI-compatible API, huge model library |
+| 2026-02-09 | LiteLLM for hybrid routing | Unified API, cost tracking, fallback support |
+| 2026-02-09 | Claude API for reasoning | Best quality reasoning, function calling for HA |
+| 2026-02-09 | Grok API for search | Internet access, 2M token context, X integration |
+| 2026-02-09 | Separate project from solar | Different build timelines, but Victron crossover planned |
+
+---
+
+## Implementation Roadmap
+
+- [ ] **Phase 1 (Week 1-2):** Set up HA Yellow, basic automations, Zigbee devices
+- [ ] **Phase 2 (Week 3):** Install Wyoming + Whisper + Piper for local voice
+- [ ] **Phase 3 (Week 4):** Build LLM server, install Ollama, test models
+- [ ] **Phase 4 (Week 5):** Set up LiteLLM proxy, add Claude + Grok API keys
+- [ ] **Phase 5 (Week 6):** Integrate LLM with HA via Extended OpenAI Conversation
+- [ ] **Phase 6 (Week 7):** Network segmentation (VLANs), PII sanitization
+- [ ] **Phase 7 (Future):** Victron solar integration (crossover with wrightstown-solar)
+
+---
+
+## Crossover with Wrightstown Solar
+
+When the Victron MultiPlus II + Cerbo GX are installed:
+- Modbus TCP integration from Cerbo into Home Assistant
+- Energy dashboard with solar production + battery SOC
+- Solar-aware automations ("run dishwasher when battery > 80%")
+- LLM-powered energy analysis ("analyze my usage trends this week")
+- See: `projects/wrightstown-solar/` for full solar system design
--- a/projects/wrightstown-smarthome/documentation/ha-yellow-setup.md
+++ b/projects/wrightstown-smarthome/documentation/ha-yellow-setup.md
@@ -0,0 +1,166 @@
+# Home Assistant Yellow - Setup Guide
+
+**Created:** 2026-02-09
+**Hardware:** Home Assistant Yellow (owned, not yet set up)
+
+---
+
+## What's In the Yellow
+
+- Custom carrier board with Zigbee 3.0 radio (external antenna)
+- Matter-ready architecture
+- M.2 slot for NVMe SSD (add terabytes of storage)
+- Power supply with international adapters (or PoE)
+- Custom heatsink and thermal pad
+- Ethernet cable
+
+**Important:** Yellow production ended October 2025, but continues receiving full software updates. No action needed -- it's still fully supported.
+
+**Requires separately:** Raspberry Pi Compute Module 4 or 5
+- Minimum: CM4 with 2GB RAM + 16GB eMMC
+- Recommended: CM4 4GB RAM + 32GB eMMC
+- CM4 Lite option requires NVMe SSD (no eMMC)
+- CM5 supported since HAOS 14 (November 2024)
+
+---
+
+## Setup Process
+
+### Hardware Assembly
+
+1. Install CM4/CM5 -- align CE mark at bottom, press firmly onto connectors
+2. Apply thermal pad to SOC (silver square near Raspberry icon)
+3. Place heatsink on thermal pad
+4. (Optional) Install NVMe SSD in M.2 slot
+5. Close enclosure
+
+### First Boot
+
+1. Connect Ethernet cable to router
+2. Connect power supply
+3. Wait 5-10 minutes for initial setup
+4. Access Home Assistant at `http://homeassistant.local:8123`
+5. Follow onboarding wizard:
+   - Create admin account
+   - Set location and timezone
+   - Auto-discover devices on network
+
+### Post-Setup Essentials
+
+1. **Install HACS** (Home Assistant Community Store) -- for custom integrations
+2. **Configure Zigbee** -- ZHA integration auto-detects the built-in radio
+3. **Set up backups** -- Automatic daily backups to NAS or cloud
+4. **Install File Editor** -- for editing YAML configs in browser
+
+---
+
+## Local Voice Assistant (Replace Alexa/Google)
+
+### Core Components (All Local, No Cloud)
+
+**Whisper** -- Speech-to-text (OpenAI's engine, runs locally)
+- Processing: ~8 seconds on RPi4 for transcription
+- Languages: English primary, expanding multilingual
+
+**Piper** -- Text-to-speech (neural TTS, optimized for RPi)
+- Multiple voice options
+- Very fast synthesis
+- Natural-sounding output
+
+**Wyoming Protocol** -- Communication standard between voice services
+- Connects Whisper, Piper, and wake word detection
+- Supports remote satellites
+
+**openWakeWord** -- Wake word detection
+- "Hey Jarvis", "Alexa" (custom), "OK Nabu"
+- Custom wake words trainable
+- English-only currently
+
+### Setup Steps
+
+1. Go to Settings > Add-ons
+2. Install "Whisper" add-on > Start
+3. Install "Piper" add-on > Start
+4. Install "openWakeWord" add-on > Start
+5. Go to Settings > Voice Assistants
+6. Create new assistant:
+   - Speech-to-text: Whisper
+   - Text-to-speech: Piper
+   - Wake word: openWakeWord
+   - Conversation agent: (Home Assistant default, or Ollama later)
+
+### Voice Satellites (Room-by-Room Control)
+
+**Option 1: Wyoming Satellite (RPi-based)**
+- Raspberry Pi Zero 2W + microphone + speaker
+- ~$30-50 per room
+- Full wake word + voice processing
+
+**Option 2: ESPHome Voice Satellite (ESP32-based)**
+- ESP32-S3 + I2S microphone + speaker
+- ~$15-25 per room
+- Auto-discovery via Zeroconf
+- Supports media player, timers, conversations
+- ESPHome 2025.5.0+ integration
+
+---
+
+## Key Integrations
+
+### Zigbee (Built-in Radio)
+
+The Yellow has a Silicon Labs Zigbee radio. Use ZHA (Zigbee Home Automation) integration:
+- Auto-detected on first boot
+- Supports 100+ Zigbee device brands
+- No additional dongle needed
+
+**Recommended Zigbee Devices:**
+- Lights: IKEA Tradfri, Philips Hue (Zigbee mode)
+- Sensors: Aqara door/window, motion, temperature
+- Switches: Sonoff SNZB series
+- Plugs: Innr, IKEA, Sonoff
+
+### Victron Solar (Future Crossover)
+
+When Victron MultiPlus II + Cerbo GX are installed:
+
+**Integration Methods:**
+- **Modbus TCP** (recommended): Direct polling via `hass-victron` custom integration
+- **MQTT**: Bidirectional via Cerbo GX
+- **VRM API**: Cloud-based (60-minute refresh, built-in since HA 2025.10)
+
+**Best Practice:** Use 5-second Modbus scan interval (not 20 seconds)
+
+**Energy Automations:**
+- Run high-load devices when battery SOC > 75%
+- Trigger charging at 25%, stop at 50%
+- EMHASS (Energy Management for Home Assistant) for optimal solar usage
+
+### Ollama / Local LLM
+
+Native Ollama integration (HA 2025.6+):
+1. Settings > Devices & Services > Add Integration > "Ollama"
+2. Enter Ollama server URL: `http://<llm-server-ip>:11434`
+3. Settings > Voice Assistants > Select "Ollama" as conversation agent
+
+**Extended OpenAI Conversation** (HACS custom component):
+- Supports OpenAI-compatible APIs (Ollama, LiteLLM)
+- Function calling for HA service control
+- Create automations, query APIs, control devices via natural language
+- Install via HACS custom repository
+
+**Multi-Model Strategy:**
+- Agent 1: Qwen 2.5 7B for real-time voice commands (fast, <1s response)
+- Agent 2: Llama 3.1 70B for complex reasoning (slower, smarter)
+- Route via LiteLLM proxy
+
+---
+
+## Sources
+
+- https://www.home-assistant.io/yellow/
+- https://yellow.home-assistant.io/guides/install-cm4/
+- https://www.home-assistant.io/integrations/wyoming/
+- https://www.home-assistant.io/voice_control/voice_remote_local_assistant/
+- https://www.home-assistant.io/integrations/ollama/
+- https://github.com/jekalmin/extended_openai_conversation
--- a/projects/wrightstown-smarthome/documentation/hybrid-bridge.md
+++ b/projects/wrightstown-smarthome/documentation/hybrid-bridge.md
@@ -0,0 +1,290 @@
+# Hybrid LLM Bridge - Local + Cloud Routing
+
+**Created:** 2026-02-09
+**Purpose:** Route queries intelligently between local Ollama, Claude API, and Grok API
+
+---
+
+## Architecture
+
+```
+User Query (voice, chat, HA automation)
+              |
+      [LiteLLM Proxy]
+       localhost:4000
+              |
+     Routing Decision
+     /       |        \
+[Ollama]  [Claude]   [Grok]
+ Local    Anthropic    xAI
+ Free     Reasoning   Search
+ Private  $3/$15/1M   $3/$15/1M
+```
+
+---
+
+## Recommended: LiteLLM Proxy
+
+Unified API gateway that presents a single OpenAI-compatible endpoint. Everything talks to `localhost:4000` and LiteLLM routes to the right backend.
+
+### Installation
+
+```bash
+pip install litellm[proxy]
+```
+
+### Configuration (`config.yaml`)
+
+```yaml
+model_list:
+  # Local models (free, private)
+  - model_name: local-fast
+    litellm_params:
+      model: ollama/qwen2.5:7b
+      api_base: http://localhost:11434
+
+  - model_name: local-reasoning
+    litellm_params:
+      model: ollama/llama3.1:70b-q4
+      api_base: http://localhost:11434
+
+  # Cloud: Claude (complex reasoning)
+  - model_name: cloud-reasoning
+    litellm_params:
+      model: claude-sonnet-4-5-20250929
+      api_key: sk-ant-XXXXX
+
+  - model_name: cloud-reasoning-cheap
+    litellm_params:
+      model: claude-haiku-4-5-20251001
+      api_key: sk-ant-XXXXX
+
+  # Cloud: Grok (internet search)
+  - model_name: cloud-search
+    litellm_params:
+      model: grok-4
+      api_key: xai-XXXXX
+      api_base: https://api.x.ai/v1
+
+router_settings:
+  routing_strategy: simple-shuffle
+  allowed_fails: 2
+  num_retries: 3
+
+  budget_policy:
+    local-fast: unlimited
+    local-reasoning: unlimited
+    cloud-reasoning: $50/month
+    cloud-reasoning-cheap: $25/month
+    cloud-search: $25/month
+```
+
+### Start the Proxy
+
+```bash
+litellm --config config.yaml --port 4000
+```
+
+### Usage
+
+Everything talks to `http://localhost:4000` with OpenAI-compatible format:
+
+```python
+import openai
+
+client = openai.OpenAI(
+    api_key="anything",  # LiteLLM doesn't need this for local
+    base_url="http://localhost:4000"
+)
+
+# Route to local
+response = client.chat.completions.create(
+    model="local-fast",
+    messages=[{"role": "user", "content": "Turn on the lights"}]
+)
+
+# Route to Claude
+response = client.chat.completions.create(
+    model="cloud-reasoning",
+    messages=[{"role": "user", "content": "Analyze my energy usage patterns"}]
+)
+
+# Route to Grok
+response = client.chat.completions.create(
+    model="cloud-search",
+    messages=[{"role": "user", "content": "What's the current electricity rate in PA?"}]
+)
+```
+
+---
+
+## Routing Strategy
+
+### What Goes Where
+
+**Local (Ollama) -- Default for everything private:**
+- Home automation commands ("turn on lights", "set thermostat to 72")
+- Sensor data queries ("what's the temperature in the garage?")
+- Camera-related queries (never send video to cloud)
+- Personal information queries
+- Simple Q&A
+- Quick lookups from local knowledge
+
+**Claude API -- Complex reasoning tasks:**
+- Detailed analysis ("analyze my energy trends this month")
+- Code generation ("write an HA automation for...")
+- Long-form content creation
+- Multi-step reasoning problems
+- Function calling for HA service control
+
+**Grok API -- Internet/real-time data:**
+- Current events ("latest news on solar tariffs")
+- Real-time pricing ("current electricity rates")
+- Weather data (if not using local integration)
+- Web searches
+- Anything requiring information the local model doesn't have
+
+### Manual vs Automatic Routing
+
+**Phase 1 (Start here):** Manual model selection
+- User picks "local-fast", "cloud-reasoning", or "cloud-search" in Open WebUI
+- Simple, no mistakes, full control
+- Good for learning which queries work best where
+
+**Phase 2 (Later):** Keyword-based routing in LiteLLM
+- Route based on keywords in the query
+- "search", "latest", "current" --> Grok
+- "analyze", "explain in detail", "write code" --> Claude
+- Everything else --> local
+
+**Phase 3 (Advanced):** Semantic routing
+- Use sentence embeddings to classify query intent
+- Small local model (all-MiniLM-L6-v2) classifies in 50-200ms
+- Most intelligent routing, but requires Python development
+
+---
+
+## Cloud API Details
+
+### Claude (Anthropic)
+
+**Endpoint:** `https://api.anthropic.com/v1/messages`
+**Get API key:** https://console.anthropic.com/
+
+**Pricing (2025-2026):**
+
+| Model | Input/1M tokens | Output/1M tokens | Best For |
+|---|---|---|---|
+| Claude Haiku 4.5 | $0.50 | $2.50 | Fast, cheap tasks |
+| Claude Sonnet 4.5 | $3.00 | $15.00 | Best balance |
+| Claude Opus 4.5 | $5.00 | $25.00 | Top quality |
+
+**Cost optimization:**
+- Prompt caching: 90% savings on repeated system prompts
+- Use Haiku for simple tasks, Sonnet for complex ones
+- Batch processing available for non-urgent tasks
+
+**Features:**
+- 200k context window
+- Extended thinking mode
+- Function calling (perfect for HA control)
+- Vision support (could analyze charts, screenshots)
+
+### Grok (xAI)
+
+**Endpoint:** `https://api.x.ai/v1/chat/completions`
+**Get API key:** https://console.x.ai/
+**Format:** OpenAI SDK compatible
+
+**Pricing:**
+
+| Model | Input/1M tokens | Output/1M tokens | Best For |
+|---|---|---|---|
+| Grok 4.1 Fast | $0.20 | $1.00 | Budget queries |
+| Grok 4 | $3.00 | $15.00 | Full capability |
+
+**Free credits:** $25 new user + $150/month if opting into data sharing program
+
+**Features:**
+- 2 million token context window (industry-leading)
+- Real-time X (Twitter) integration
+- Internet search capability
+- OpenAI SDK compatibility
+
+---
+
+## Monthly Cost Estimates
+
+### Conservative Use (80/15/5 Split, 1000 queries/month)
+
+| Route | Queries | Model | Cost |
+|---|---|---|---|
+| Local (80%) | 800 | Ollama | $0 |
+| Claude (15%) | 150 | Haiku 4.5 | ~$0.45 |
+| Grok (5%) | 50 | Grok 4.1 Fast | ~$0.07 |
+| **Total** | **1000** | | **~$0.52/month** |
+
+### Heavy Use (60/25/15 Split, 3000 queries/month)
+
+| Route | Queries | Model | Cost |
+|---|---|---|---|
+| Local (60%) | 1800 | Ollama | $0 |
+| Claude (25%) | 750 | Sonnet 4.5 | ~$15 |
+| Grok (15%) | 450 | Grok 4 | ~$9 |
+| **Total** | **3000** | | **~$24/month** |
+
+**Add electricity for LLM server:** ~$15-30/month (RTX 4090 build)
+
+---
+
+## Home Assistant Integration
+
+### Connect HA to LiteLLM Proxy
+
+**Option 1: Extended OpenAI Conversation (Recommended)**
+
+Install via HACS, then configure:
+- API Base URL: `http://<llm-server-ip>:4000/v1`
+- API Key: (any string, LiteLLM doesn't validate for local)
+- Model: `local-fast` (or any model name from your config)
+
+This gives HA natural language control:
+- "Turn off all lights downstairs" --> local LLM understands --> calls HA service
+- "What's my battery charge level?" --> queries HA entities --> responds
+
+**Option 2: Native Ollama Integration**
+
+Settings > Integrations > Ollama:
+- URL: `http://<llm-server-ip>:11434`
+- Simpler but bypasses the routing layer
+
+### Voice Assistant Pipeline
+
+```
+Wake word detected ("Hey Jarvis")
+         |
+   Whisper (speech-to-text, local)
+         |
+   Query text
+         |
+   Extended OpenAI Conversation
+         |
+   LiteLLM Proxy (routing)
+         |
+   Response text
+         |
+   Piper (text-to-speech, local)
+         |
+   Speaker output
+```
+
+---
+
+## Sources
+
+- https://docs.litellm.ai/
+- https://github.com/open-webui/open-webui
+- https://console.anthropic.com/
+- https://docs.x.ai/developers/models
+- https://github.com/jekalmin/extended_openai_conversation
+- https://github.com/aurelio-labs/semantic-router
--- a/projects/wrightstown-smarthome/documentation/llm-server-build.md
+++ b/projects/wrightstown-smarthome/documentation/llm-server-build.md
@@ -0,0 +1,192 @@
+# Local LLM Server - Build Guide
+
+**Created:** 2026-02-09
+**Purpose:** Dedicated local LLM inference server for smart home + general use
+
+---
+
+## Recommended Build: RTX 4090 (Sweet Spot)
+
+| Component | Spec | Est. Cost |
+|---|---|---|
+| GPU | NVIDIA RTX 4090 24GB | $1,200-1,500 |
+| CPU | AMD Ryzen 7 5800X3D | $250 |
+| Motherboard | B550 | $120 |
+| RAM | 64GB DDR4 (2x32GB) | $120 |
+| Storage | 1TB NVMe Gen4 | $80 |
+| PSU | 850W Gold | $100 |
+| Case | Mid-tower, good airflow | $70 |
+| **Total** | | **$1,940-2,240** |
+
+### Why This Build
+
+- 24GB VRAM runs 70B parameter models at 4-bit quantization (7-9 tok/s)
+- 30B models at 8-bit quality (20+ tok/s)
+- 7-8B models at full speed (30+ tok/s)
+- Handles everything from quick voice commands to serious reasoning
+- Single GPU keeps complexity low
+
+### Alternative Builds
+
+**Budget (~$580):**
+- RTX 3060 12GB (used, $250)
+- Ryzen 5 5600X + B450 + 32GB DDR4
+- Runs 7B models great, 13B quantized
+- Fine for voice commands and basic queries
+
+**Flagship (~$4,000-5,400):**
+- RTX 5090 32GB ($2,500-3,800)
+- Ryzen 9 7950X + X670E + 128GB DDR5
+- 70B models at high quality (20+ tok/s)
+- 67% faster than 4090
+- Future-proof for larger models
+
+**Efficiency (Mac Mini M4 Pro, $1,399+):**
+- 24-64GB unified memory
+- 20W power draw vs 450W for 4090
+- Great for always-on server
+- Qwen 2.5 32B at 11-12 tok/s on 64GB config
+
+---
+
+## Software Stack
+
+### Primary: Ollama
+
+```bash
+# Install (Linux)
+curl -fsSL https://ollama.com/install.sh | sh
+
+# Install (Windows)
+# Download from https://ollama.com/download
+
+# Pull models
+ollama pull qwen2.5:7b        # Fast voice commands
+ollama pull llama3.1:8b        # General chat
+ollama pull qwen2.5:32b        # Complex reasoning (needs 24GB+ VRAM)
+ollama pull llama3.1:70b-q4    # Near-cloud quality (needs 24GB VRAM)
+
+# API runs on http://localhost:11434
+```
+
+**Why Ollama:**
+- One-command install
+- Built-in model library
+- OpenAI-compatible API (works with LiteLLM, Open WebUI, HA)
+- Auto-optimized for your hardware
+- Dead simple
+
+### Web Interface: Open WebUI
+
+```bash
+docker run -d -p 3000:8080 \
+  -v open-webui:/app/backend/data \
+  -e OLLAMA_BASE_URL=http://localhost:11434 \
+  ghcr.io/open-webui/open-webui:main
+```
+
+- ChatGPT-like interface at `http://<server-ip>:3000`
+- Model selection dropdown
+- Conversation history
+- RAG (Retrieval Augmented Generation) support
+- User accounts
+
+---
+
+## Model Recommendations
+
+### For Home Assistant Voice Commands (Speed Priority)
+
+| Model | VRAM | Speed (4090) | Quality | Use Case |
+|---|---|---|---|---|
+| Qwen 2.5 7B | 4GB | 30+ tok/s | Good | Voice commands, HA control |
+| Phi-3 Mini 3.8B | 2GB | 40+ tok/s | Decent | Ultra-fast simple queries |
+| Llama 3.1 8B | 5GB | 30+ tok/s | Good | General chat |
+| Gemma 2 9B | 6GB | 25+ tok/s | Good | Efficient general use |
+
+### For Complex Reasoning (Quality Priority)
+
+| Model | VRAM (Q4) | Speed (4090) | Quality | Use Case |
+|---|---|---|---|---|
+| Qwen 2.5 32B | 18GB | 15-20 tok/s | Very Good | Detailed analysis |
+| Llama 3.1 70B | 35GB | 7-9 tok/s | Excellent | Near-cloud reasoning |
+| DeepSeek R1 | Varies | 10-15 tok/s | Excellent | Step-by-step reasoning |
+| Mistral 22B | 12GB | 20+ tok/s | Good | Balanced speed/quality |
+
+### For Code
+
+| Model | Best For |
+|---|---|
+| DeepSeek Coder V2 | Debugging, refactoring |
+| Qwen3 Coder | Agentic coding tasks |
+| Llama 3.1 70B | General code + reasoning |
+
+### Quantization Guide
+
+4-bit quantization maintains 95-98% quality at 75% less VRAM. Most users can't tell the difference.
+
+| Quantization | VRAM Use | Quality | When to Use |
+|---|---|---|---|
+| Q4_K_M | 25% of full | 95-98% | Default for most models |
+| Q5_K_M | 31% of full | 97-99% | When you have spare VRAM |
+| Q8_0 | 50% of full | 99%+ | When quality matters most |
+| FP16 (full) | 100% | 100% | Only for small models (7B) |
+
+---
+
+## RAM Requirements
+
+| Model Size | Minimum System RAM | Recommended | GPU VRAM (Q4) |
+|---|---|---|---|
+| 7B | 16GB | 32GB | 4-5GB |
+| 13B | 32GB | 32GB | 8-9GB |
+| 30B | 32GB | 64GB | 16-18GB |
+| 70B | 64GB | 128GB | 35-40GB |
+
+---
+
+## Power and Noise Considerations
+
+This is an always-on server in your home:
+
+| GPU | TDP | Idle Power | Annual Electricity (24/7) |
+|---|---|---|---|
+| RTX 3060 12GB | 170W | ~15W | ~$50-80/yr idle |
+| RTX 4060 Ti 16GB | 165W | ~12W | ~$45-75/yr idle |
+| RTX 4090 24GB | 450W | ~20W | ~$60-100/yr idle |
+| RTX 5090 32GB | 575W | ~25W | ~$75-120/yr idle |
+
+**Tips:**
+- Configure GPU to idle at low power when no inference running
+- Use Ollama's auto-unload (models unload from VRAM after idle timeout)
+- Consider noise: a 4090 under load is not quiet. Use a case with good fans and put the server in a utility room/closet
+
+---
+
+## Server OS Recommendations
+
+**Ubuntu Server 24.04 LTS** (recommended)
+- Best NVIDIA driver support
+- Docker native
+- Easy Ollama install
+- Headless (no GUI overhead)
+
+**Windows 11** (if you want dual-use)
+- Ollama has native Windows support
+- Docker Desktop for Open WebUI
+- More overhead than Linux
+
+**Proxmox** (if you want to run multiple VMs/containers)
+- GPU passthrough to LLM VM
+- Run other services alongside
+- More complex setup
+
+---
+
+## Sources
+
+- https://localllm.in/blog/best-gpus-llm-inference-2025
+- https://sanj.dev/post/affordable-ai-hardware-local-llms
+- https://introl.com/blog/local-llm-hardware-pricing-guide-2025
+- https://www.glukhov.org/post/2025/11/hosting-llms-ollama-localai-jan-lmstudio-vllm-comparison/
+- https://huggingface.co/blog/daya-shankar/open-source-llms
--- a/projects/wrightstown-smarthome/documentation/network-security.md
+++ b/projects/wrightstown-smarthome/documentation/network-security.md
@@ -0,0 +1,210 @@
+# Network Security - VLAN Architecture & Privacy
+
+**Created:** 2026-02-09
+**Purpose:** IoT isolation, private data protection, PII sanitization for cloud APIs
+
+---
+
+## VLAN Architecture
+
+```
+---------------------------------------------+
+|  VLAN 1: Trusted (192.168.1.0/24)          |
+|  Laptops, phones, tablets                   |
+|  Full internet access                       |
+|  Can initiate connections to all VLANs      |
+---------------------------------------------+
+                  | (can access)
+---------------------------------------------+
+|  VLAN 10: Infrastructure (192.168.10.0/24)  |
+|  Home Assistant Yellow                      |
+|  LLM Server                                |
+|  NAS (if applicable)                        |
+|  Can access Trusted + IoT VLANs            |
+---------------------------------------------+
+                  | (can access)
+---------------------------------------------+
+|  VLAN 20: IoT (192.168.20.0/24)            |
+|  Zigbee coordinator (HA Yellow)             |
+|  WiFi cameras, sensors, smart plugs        |
+|  BLOCKED from initiating to Trusted        |
+|  Internet restricted (DNS/NTP only)         |
+---------------------------------------------+
+                  | (restricted)
+---------------------------------------------+
+|  VLAN 99: Guest (192.168.99.0/24)          |
+|  Guest devices                              |
+|  Internet only, no internal access          |
+---------------------------------------------+
+```
+
+---
+
+## Firewall Rules
+
+### Rule 1: IoT Isolation (Critical)
+- **ALLOW:** Trusted --> IoT (control devices from phone)
+- **BLOCK:** IoT --> Trusted (prevent compromised device probing network)
+
+### Rule 2: Infrastructure Bridge
+- **ALLOW:** Infrastructure --> IoT (HA controls devices)
+- **ALLOW:** Infrastructure --> Trusted (send notifications)
+- **ALLOW:** Trusted --> Infrastructure (access HA web UI, LLM)
+
+### Rule 3: IoT Internet Restriction
+- **ALLOW:** IoT --> DNS (port 53) and NTP (port 123)
+- **BLOCK:** IoT --> Internet (all other ports)
+- **EXCEPTION:** Whitelist specific cloud services if device requires it
+
+### Rule 4: mDNS Control
+- **BLOCK:** Broadcast protocols across VLANs by default
+- **ALLOW:** Selective mDNS reflection for HA discovery
+
+---
+
+## Hardware Options
+
+### Budget: TP-Link Omada (~$150)
+- ER605 router ($60) -- VLAN routing, firewall rules
+- TL-SG2008P managed switch ($90) -- VLAN tagging, PoE
+
+### Mid-tier: Ubiquiti UniFi (~$760)
+- Dream Machine Pro ($379) -- Router + controller + IDS
+- USW-24-PoE switch ($379) -- 24 ports, VLAN, PoE
+- Better UI, more features, IDS/IPS built in
+
+### Existing Gear
+- Most Netgear managed switches support VLANs
+- OpenWRT on consumer routers adds VLAN capability
+- pfSense/OPNsense on old PC is free and powerful
+
+---
+
+## Privacy: Keeping Data Local
+
+### Core Principle
+
+**Private data NEVER leaves the local network.**
+
+| Data Type | Route | Why |
+|---|---|---|
+| Sensor readings | Local LLM only | Reveals activity patterns |
+| Camera feeds | Local LLM only | Obvious privacy concern |
+| Device names/locations | Local LLM only | Reveals home layout |
+| Presence detection | Local LLM only | Reveals who's home |
+| Personal names/addresses | Strip before cloud | PII |
+| Energy usage patterns | Sanitize before cloud | Activity inference |
+| General questions | Cloud OK | No private data |
+| Internet searches | Cloud OK (Grok) | No private data |
+
+### PII Sanitization Pipeline
+
+For queries that go to cloud APIs, scrub private information first:
+
+```python
+from presidio_analyzer import AnalyzerEngine
+from presidio_anonymizer import AnonymizerEngine
+
+analyzer = AnalyzerEngine()
+anonymizer = AnonymizerEngine()
+
+def sanitize_for_cloud(query):
+    """Remove PII before sending to Claude/Grok"""
+
+    # Detect sensitive entities
+    results = analyzer.analyze(
+        text=query,
+        entities=["PERSON", "LOCATION", "PHONE_NUMBER",
+                  "EMAIL_ADDRESS", "DATE_TIME"],
+        language="en"
+    )
+
+    # Anonymize detected entities
+    sanitized = anonymizer.anonymize(text=query, analyzer_results=results)
+
+    # Hard block certain categories
+    blocked_keywords = ["camera", "location", "address",
+                       "password", "who is home", "alarm"]
+    if any(kw in query.lower() for kw in blocked_keywords):
+        return None  # Block query entirely, handle locally
+
+    return sanitized.text
+```
+
+### Cloud API Data Policies
+
+**Anthropic (Claude):**
+- API inputs are NOT used for training by default
+- Can explicitly opt out
+- Data retained 30 days for safety, then deleted
+
+**xAI (Grok):**
+- Data sharing program is opt-in ($150/month credit if you opt in)
+- Can opt out and keep data private
+- Standard API usage not used for training if opted out
+
+---
+
+## Remote Access
+
+### Recommended: Tailscale (Zero-Config VPN)
+
+```bash
+# Install on LLM server and HA
+curl -fsSL https://tailscale.com/install.sh | sh
+tailscale up
+```
+
+- WireGuard-based mesh network
+- No port forwarding needed
+- Free for personal use (up to 20 devices)
+- Access HA + LLM from anywhere securely
+
+### Alternative: WireGuard (Self-Hosted)
+
+- Run on router or dedicated server
+- Full control, no third-party dependency
+- Requires port forwarding (one UDP port)
+- More setup, more control
+
+### Home Assistant Cloud (Nabu Casa)
+
+- $6.50/month, official HA remote access
+- No VPN config needed
+- Supports HA development team
+- Simplest option
+
+---
+
+## Security Hardening Checklist
+
+- [ ] Disable UPnP on router
+- [ ] Enable 2FA on Home Assistant
+- [ ] Strong passwords (16+ chars, random) on all services
+- [ ] Regular updates: HA, Ollama, OS, router firmware
+- [ ] Monitor failed login attempts in HA logs
+- [ ] Daily automated backups (HA + LLM configs)
+- [ ] Firewall rules reviewed quarterly
+- [ ] IoT devices on isolated VLAN
+- [ ] No camera feeds sent to cloud APIs
+- [ ] PII sanitization active on cloud-bound queries
+
+---
+
+## Data Retention
+
+| System | Retention | Notes |
+|---|---|---|
+| HA sensor data | 30 days raw, indefinite aggregated | Purge in Settings > System > Storage |
+| Camera recordings | 7-14 days | Local storage only (NAS or NVMe) |
+| LLM conversation logs | Purge monthly | Ollama logs stored locally |
+| Cloud API logs | Disable or redact PII | Check provider settings |
+
+---
+
+## Sources
+
+- https://newerest.space/mastering-network-segmentation-vlans-home-assistant-iot-security/
+- https://www.xda-developers.com/vlan-rules-every-smart-home-should-have/
+- https://thehomesmarthome.com/home-assistant-security-vlans-firewalls-ids/
+- https://tailscale.com/
--- a/projects/wrightstown-smarthome/session-logs/2026-02-09-session.md
+++ b/projects/wrightstown-smarthome/session-logs/2026-02-09-session.md
@@ -0,0 +1,85 @@
+# Wrightstown Smart Home - Session Log 2026-02-09
+
+**Session:** Initial Research and Planning
+**Machine:** ACG-M-L5090
+
+---
+
+## Work Completed
+
+### 1. Project Scope Defined
+- Privacy-first smart home (no Google/Alexa)
+- Home Assistant Yellow (already owned, not set up)
+- Local LLM server for private queries
+- Hybrid bridge: local + Claude API + Grok API
+- Separate project from Wrightstown Solar, with planned crossover
+
+### 2. Home Assistant Yellow Research
+- Yellow still receives updates despite production ending Oct 2025
+- Needs CM4 or CM5 compute module (not included)
+- Built-in Zigbee 3.0 radio, M.2 NVMe slot
+- Local voice: Wyoming + Whisper + Piper (all local, no cloud)
+- Victron integration planned via Modbus TCP (future crossover)
+
+### 3. Local LLM Server Research
+- Recommended build: RTX 4090 24GB ($1,940-2,240)
+- Software: Ollama (primary) + Open WebUI (interface)
+- Models: Qwen 2.5 7B (fast voice), Llama 3.1 70B Q4 (reasoning)
+- Alternative builds researched: budget ($580), flagship ($4,000+), Mac Mini M4
+
+### 4. Hybrid Bridge Design
+- LiteLLM proxy as unified routing layer
+- Routes: local (Ollama) / Claude API (reasoning) / Grok API (search)
+- 80/15/5 split estimated at ~$5/month cloud costs
+- Manual routing first, keyword-based later, semantic routing eventual
+- Integration with HA via Extended OpenAI Conversation
+
+### 5. Network Security Design
+- 4-VLAN architecture: Trusted / Infrastructure / IoT / Guest
+- IoT isolation (devices can't reach trusted network)
+- PII sanitization pipeline for cloud-bound queries
+- Private data (cameras, sensors, presence) stays local only
+- Remote access via Tailscale or WireGuard
+
+---
+
+## Decisions Made
+
+| Decision | Chosen | Rationale |
+|---|---|---|
+| Smart home platform | Home Assistant | Open source, local-first, 2000+ integrations |
+| Hardware | HA Yellow (owned) | Built-in Zigbee, already purchased |
+| Voice assistant | Wyoming/Whisper/Piper | Fully local, no cloud dependency |
+| LLM server GPU | RTX 4090 24GB (recommended) | Best price/performance for 70B models |
+| LLM software | Ollama | Simplest, OpenAI-compatible API |
+| Hybrid routing | LiteLLM | Unified API, cost tracking, fallbacks |
+| Cloud: reasoning | Claude API (Anthropic) | Best reasoning quality |
+| Cloud: search | Grok API (xAI) | Internet access, 2M context |
+| Network | VLAN segmentation | IoT isolation, privacy |
+| Separate from solar | Yes | Different timelines, crossover later |
+
+---
+
+## Files Created
+
+- `projects/wrightstown-smarthome/PROJECT_INDEX.md`
+- `projects/wrightstown-smarthome/documentation/ha-yellow-setup.md`
+- `projects/wrightstown-smarthome/documentation/llm-server-build.md`
+- `projects/wrightstown-smarthome/documentation/hybrid-bridge.md`
+- `projects/wrightstown-smarthome/documentation/network-security.md`
+- `projects/wrightstown-smarthome/session-logs/2026-02-09-session.md` (this file)
+
+---
+
+## Next Steps
+
+- [ ] Check if CM4/CM5 module is owned or needs purchasing
+- [ ] Set up HA Yellow (basic install, Zigbee, first automations)
+- [ ] Research specific Zigbee devices to purchase
+- [ ] Decide on LLM server GPU budget (3060 budget vs 4090 sweet spot)
+- [ ] Purchase LLM server hardware
+- [ ] Decide on VLAN hardware (TP-Link Omada vs Ubiquiti UniFi)
+- [ ] Set up Ollama + Open WebUI
+- [ ] Create Anthropic API account + Grok API account
+- [ ] Configure LiteLLM proxy
+- [ ] Integrate with HA via Extended OpenAI Conversation