AI Tools Kit
AI Tools Kit provides free developer tools for working with AI language models. Built by developers, for developers.
Learn more about us →Related Articles
How Speculative Decoding Works: Draft Models and 3x Speedup
Speculative decoding proposes token batches with a small draft model and verifies them in one large-model pass — 2-3x speedup with zero quality loss. Here's the algorithm, the acceptance math, and when it fails.
Inside Model Context Protocol: How MCP Servers Actually Work
MCP connects AI models to tools via JSON-RPC 2.0 across stdio and HTTP transports. This deep-dive covers the host-client-server split, capability negotiation, the tool call state machine, and why the protocol was designed this way.
How Vector Databases Actually Work: HNSW, ANN, and Retrieval Architecture
Vector databases are not magic. This deep-dive covers HNSW graph structure, ANN tradeoffs, index construction costs, and the retrieval pipeline behind every RAG system.