Claude Code does not have to mean cloud costs anymore.
In this blog, Manjunath Janardhan walks us through how to run Claude Code locally on Apple Silicon using LM Studio + LiteLLM + Qwen3-Coder-30B — with zero API cost and no cloud dependency.
The setup uses:
LM Studio for local inference
Qwen3-Coder-30B as the coding model
LiteLLM as a bridge between Claude Code and a local OpenAI-compatible endpoint
This is a practical way to get the Claude Code experience fully offline on macOS, especially for M-series Macs where MLX models can make a real difference.
A solid guide for developers who care about:
Local-first workflows
Privacy
Reducing API spend
Getting more from Apple Silicon
Worth reading if you want to run agentic coding tools on your own machine without relying on the cloud.