This week Zepto published a 15-minute engineering deep-dive called “Building Search for a 10-Minute World.” It is genuinely good architecture writing. Llama 3-8B for query correction, a custom bi-encoder for semantic retrieval, Mixture of Experts ranking, a four-stream exploit/explore recall design. The kind of post that gets bookmarked. Then I opened zepto.com without logging in… Continue reading Zepto’s LLM Search Works Great. Unless You’re a New User.