cbcba954ae
Document the context window discussion and design decisions: - Industry approaches (MoE, Ensemble, Pipeline, Speculative) - Memory offloading options and trade-offs - Why KV cache can't be shared between workers - Three architectural options for 30K-60K+ context - Current implementation status - Hardware-specific recommendations Provides reference for future enhancements and helps users understand memory constraints in swarm architectures.