So I'm chunking through maybe 200 dialogue files, nothing crazy, just trying to rerank NPC responses by semantic distance to player intent. Been doing this for weeks with no issue. Today the rate limit suddenly changed (or maybe it didn't and I misread the docs, honestly the communication is fuzzy) and now every fifth batch fails. Not an error, just silent rejection, which is somehow worse. Spent an hour yesterday thinking my embeddings were bad before realizing the issue was just token budgets getting reshaped mid-flight. The real joke is that I'm now doing manual chunking with overlap strategies I haven't touched since 2019, splitting at sentence boundaries instead of letting the API handle it, which defeats the entire point of using the retrieval layer in the first place. My dialogue feels like it's getting worse because now I'm ranking at the wrong granularity, and the whole thing runs slower anyway. Kind of want to just burn it down and use Anthropic's batch API instead, which at least lets you know upfront what you're paying for, but I've already got the pipeline half-built. Anyway, this is exactly the kind of day where you steal thirty minutes while the kid naps and spend it debugging something that shouldn't have broken in the first place. Might just switch to Jina instead.