feat: Major directory reorganization and cleanup

Reorganized project structure for better maintainability and reduced disk usage by 95.9% (11 GB -> 451 MB). Directory Reorganization (85% reduction in root files): - Created docs/ with subdirectories (deployment, testing, database, etc.) - Created infrastructure/vpn-configs/ for VPN scripts - Moved 90+ files from root to organized locations - Archived obsolete documentation (context system, offline mode, zombie debugging) - Moved all test files to tests/ directory - Root directory: 119 files -> 18 files Disk Cleanup (10.55 GB recovered): - Deleted Rust build artifacts: 9.6 GB (target/ directories) - Deleted Python virtual environments: 161 MB (venv/ directories) - Deleted Python cache: 50 KB (__pycache__/) New Structure: - docs/ - All documentation organized by category - docs/archives/ - Obsolete but preserved documentation - infrastructure/ - VPN configs and SSH setup - tests/ - All test files consolidated - logs/ - Ready for future logs Benefits: - Cleaner root directory (18 vs 119 files) - Logical organization of documentation - 95.9% disk space reduction - Faster navigation and discovery - Better portability (build artifacts excluded) Build artifacts can be regenerated: - Rust: cargo build --release (5-15 min per project) - Python: pip install -r requirements.txt (2-3 min) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-18 20:42:28 -07:00
parent 89e5118306
commit 06f7617718
96 changed files with 54 additions and 2639 deletions
--- a/docs/database/BULK_IMPORT_RESULTS.md
+++ b/docs/database/BULK_IMPORT_RESULTS.md
@@ -0,0 +1,276 @@
+# Claude Conversation Bulk Import Results
+
+**Date:** 2026-01-16
+**Import Location:** `C:\Users\MikeSwanson\.claude\projects`
+**Database:** ClaudeTools @ 172.16.3.20:3306
+
+---
+
+## Import Summary
+
+### Files Scanned
+- **Total Files Found:** 714 conversation files (.jsonl)
+- **Successfully Processed:** 65 files
+- **Contexts Created:** 68 contexts (3 duplicates from ClaudeTools-only import)
+- **Errors/Empty Files:** 649 files (mostly empty or invalid conversation files)
+- **Success Rate:** 9.1% (65/714)
+
+### Why So Many Errors?
+Most of the 649 "errors" were actually empty conversation files or subagent files with no messages. This is normal for Claude projects - many conversation files are created but not all contain actual conversation content.
+
+---
+
+## Context Breakdown
+
+### By Context Type
+| Type | Count | Description |
+|------|-------|-------------|
+| `general_context` | 37 | General conversations and interactions |
+| `project_state` | 26 | Project-specific development work |
+| `session_summary` | 5 | Work session summaries |
+
+### By Relevance Score
+| Score Range | Count | Quality |
+|-------------|-------|---------|
+| 8-10 | 3 | Excellent - Highly relevant technical contexts |
+| 6-8 | 18 | Good - Useful project and development work |
+| 4-6 | 8 | Fair - Some useful information |
+| 2-4 | 26 | Low - General conversations |
+| 0-2 | 13 | Minimal - Very brief interactions |
+
+### Top 5 Highest Quality Contexts
+
+1. **Conversation: api/models/__init__.py**
+   - Score: 10.0/10.0
+   - Type: project_state
+   - Messages: 16
+   - Duration: 38,069 seconds (~10.6 hours)
+   - Tags: development, fastapi, sqlalchemy, alembic, docker, nginx, python, javascript, typescript, api, database, auth, security, testing, deployment, crud, error-handling, validation, optimization, refactor
+   - Key Decisions: SQL syntax for incident_type, severity, status enums
+
+2. **Conversation: Unknown**
+   - Score: 8.0/10.0
+   - Type: project_state
+   - Messages: 78
+   - Duration: 229,154 seconds (~63.7 hours)
+   - Tags: development, postgresql, sqlalchemy, python, javascript, typescript, api, database, auth, security, testing, deployment, crud, error-handling, optimization, critical, blocker, bug, feature, architecture
+
+3. **Conversation: base_events.py**
+   - Score: 7.6/10.0
+   - Type: project_state
+   - Messages: 13
+   - Duration: 34,753 seconds (~9.7 hours)
+   - Tags: development, fastapi, alembic, python, typescript, api, database, testing, async, crud, error-handling, bug, feature, integration
+
+---
+
+## Tag Distribution
+
+### Most Common Tags
+Based on the imported contexts, the following tags appear most frequently:
+
+**Development:**
+- `development` (appears in most project_state contexts)
+- `api`, `crud`, `error-handling`
+- `testing`, `deployment`, `integration`
+
+**Technologies:**
+- `python`, `typescript`, `javascript`
+- `fastapi`, `sqlalchemy`, `alembic`
+- `docker`, `postgresql`, `database`
+
+**Security & Auth:**
+- `auth`, `security`
+
+**Work Types:**
+- `bug`, `feature`
+- `optimization`, `refactor`, `validation`
+
+**MSP-Specific:**
+- `msp` (5 contexts tagged with MSP work)
+
+---
+
+## Verification Tests
+
+### Context Recall Tests
+
+**Test 1: FastAPI + SQLAlchemy contexts**
+```bash
+GET /api/conversation-contexts/recall?tags=fastapi&tags=sqlalchemy&limit=3&min_relevance_score=6.0
+```
+**Result:** Successfully recalled 3 contexts
+
+**Test 2: MSP-related contexts**
+```bash
+GET /api/conversation-contexts/recall?tags=msp&limit=5
+```
+**Result:** Successfully recalled 5 contexts
+
+**Test 3: High-relevance contexts**
+```bash
+GET /api/conversation-contexts?min_relevance_score=8.0
+```
+**Result:** Retrieved 3 high-quality contexts (scores 8.0-10.0)
+
+---
+
+## Import Process
+
+### Step 1: Preview
+```bash
+python test_import_preview.py "C:\Users\MikeSwanson\.claude\projects"
+```
+- Found 714 conversation files
+- Category breakdown: 20 files shown as samples
+
+### Step 2: Dry Run
+```bash
+python scripts/import-claude-context.py --folder "C:\Users\MikeSwanson\.claude\projects" --dry-run
+```
+- Scanned 714 files
+- Would process 65 successfully
+- Would create 65 contexts
+- Encountered 649 errors (empty files)
+
+### Step 3: ClaudeTools Project Import (First Pass)
+```bash
+python scripts/import-claude-context.py --folder "C:\Users\MikeSwanson\.claude\projects\D--ClaudeTools" --execute
+```
+- Scanned 70 files
+- Processed 3 successfully
+- Created 3 contexts
+- 67 errors (empty subagent files)
+
+### Step 4: Full Import (All Projects)
+```bash
+python scripts/import-claude-context.py --folder "C:\Users\MikeSwanson\.claude\projects" --execute
+```
+- Scanned 714 files
+- Processed 65 successfully
+- Created 65 contexts (includes the 3 from ClaudeTools)
+- 649 errors (empty files)
+
+**Note:** Total contexts in database = 68 (3 from first import + 65 from full import, with 3 duplicates)
+
+---
+
+## Database Status
+
+### Connection Details
+- **Host:** 172.16.3.20:3306
+- **Database:** claudetools
+- **Total Contexts:** 68
+- **API Endpoint:** http://localhost:8000/api/conversation-contexts
+
+### JWT Authentication
+- **Token Location:** `.claude/context-recall-config.env`
+- **Token Expiration:** 2026-02-16 (30 days)
+- **Scopes:** admin, import
+
+---
+
+## Context Quality Analysis
+
+### Excellent Contexts (8-10 score)
+These 3 contexts represent substantial development work:
+- Deep technical discussions
+- Multiple hours of focused work
+- Rich tag sets (15-20 tags each)
+- Key architectural decisions documented
+
+### Good Contexts (6-8 score)
+18 contexts with solid development content:
+- Project-specific work
+- API development
+- Database design
+- Testing and deployment
+
+### Fair to Low Contexts (0-6 score)
+47 contexts with general content:
+- Brief interactions
+- Simple CRUD operations
+- Quick questions/answers
+- Less technical depth
+
+---
+
+## Next Steps
+
+### Using Context Recall
+
+**1. Automatic Recall (via hooks)**
+The system will automatically recall relevant contexts based on:
+- Current project directory
+- Keywords in your prompt
+- Active conversation tags
+
+**2. Manual Recall**
+Query specific contexts:
+```bash
+curl -H "Authorization: Bearer $JWT_TOKEN" \
+  "http://localhost:8000/api/conversation-contexts/recall?tags=fastapi&tags=database&limit=5"
+```
+
+**3. Browse All Contexts**
+```bash
+curl -H "Authorization: Bearer $JWT_TOKEN" \
+  "http://localhost:8000/api/conversation-contexts?limit=100"
+```
+
+### Improving Context Quality
+
+For future conversations to be imported with higher quality:
+1. Use descriptive project names
+2. Work on focused topics per conversation
+3. Document key decisions explicitly
+4. Use consistent terminology (tags will be auto-extracted)
+5. Longer conversations generally receive higher relevance scores
+
+---
+
+## Files Created
+
+1. **D:\ClaudeTools\test_import_preview.py** - Preview tool
+2. **D:\ClaudeTools\scripts\import-claude-context.py** - Import script
+3. **D:\ClaudeTools\analyze_import.py** - Analysis tool
+4. **D:\ClaudeTools\BULK_IMPORT_RESULTS.md** - This summary document
+
+---
+
+## Troubleshooting
+
+### If contexts aren't being recalled:
+1. Check API is running: `http://localhost:8000/api/health`
+2. Verify JWT token: `cat .claude/context-recall-config.env`
+3. Test recall endpoint manually (see examples above)
+4. Check hook permissions: `.claude/hooks/user-prompt-submit`
+
+### If you want to re-import:
+```bash
+# Delete existing contexts (if needed)
+# Then re-run import with --execute flag
+python scripts/import-claude-context.py --folder "path" --execute
+```
+
+---
+
+## Success Metrics
+
+✅ **68 contexts successfully imported**
+✅ **3 excellent-quality contexts** (score 8-10)
+✅ **21 good-quality contexts** (score 6-10 total)
+✅ **Context recall API working** (tested with multiple tag queries)
+✅ **JWT authentication functioning** (token valid for 30 days)
+✅ **All context types represented** (general, project_state, session_summary)
+✅ **Rich tag distribution** (30+ unique technical tags)
+
+---
+
+**Import Status:** ✅ COMPLETE
+**System Status:** ✅ OPERATIONAL
+**Context Recall:** ✅ READY FOR USE
+
+---
+
+**Last Updated:** 2026-01-16 03:48 UTC