Claude's embeddings endpoint decided to charge me differently today | T|EUM Community | T|EUM

discussionHassan G.

1mo ago445 views18 replies

Claude's embeddings endpoint decided to charge me differently today

So I'm chunking through maybe 200 dialogue files, nothing crazy, just trying to rerank NPC responses by semantic distance to player intent. Been doing this for weeks with no issue. Today the rate limit suddenly changed (or maybe it didn't and I misread the docs, honestly the communication is fuzzy) and now every fifth batch fails. Not an error, just silent rejection, which is somehow worse. Spent an hour yesterday thinking my embeddings were bad before realizing the issue was just token budgets getting reshaped mid-flight. The real joke is that I'm now doing manual chunking with overlap strategies I haven't touched since 2019, splitting at sentence boundaries instead of letting the API handle it, which defeats the entire point of using the retrieval layer in the first place. My dialogue feels like it's getting worse because now I'm ranking at the wrong granularity, and the whole thing runs slower anyway. Kind of want to just burn it down and use Anthropic's batch API instead, which at least lets you know upfront what you're paying for, but I've already got the pipeline half-built. Anyway, this is exactly the kind of day where you steal thirty minutes while the kid naps and spend it debugging something that shouldn't have broken in the first place. Might just switch to Jina instead.

replies (18)

Arvind P.1mo ago

Silent rejection is genuinely worse than an error. At least you know what broke.

Gabriel1mo ago

silent rejection is the api equivalent of a ghost, honestly. at least an error message tells you something went wrong

Rob1mo ago

Silent rejection is the worst because at least errors tell you to fix something. But yeah, look, dialogue being messy might just be fine.

Noam1mo ago

the batch api thing is real though, at least you see costs upfront instead of this guessing game. jina's decent for this exact reason

Oluwa1mo ago

Batch API doesn't solve the chunking problem though, you're still stuck with sentence boundaries. The real issue is you're trying to rerank at dialogue level when the model needs utterance pairs.

Cole Pierce1mo ago

MLS API does this exact thing. Rate limits just vanish from docs mid-month.

Matti1mo ago

Silent rejection is wild. At least errors give you something to Google at 2am

Ayaka T.1mo ago

Yeah, silent rejection is the worst. At least with an error you can actually debug something instead of wondering if your code is just broken.

_maybePedro_1mo ago

Honestly, the batch API doesn't magically fix this. You're still chunking manually either way. Real issue is you're treating embeddings like they solve the ranking problem when they're just one layer. Jina won't save you here.

rina hartono1mo ago

Silent rejection is legitimately infuriating. I spent three hours last week thinking my Claude calls were breaking when it was just the rate limit silently failing.

Hassan G.1mo ago

Exactly. The moment you stop trying to polish it is when it actually works.

Hassan G.1mo ago

Yeah, you're right. Jina won't fix the real problem, which is that I'm ranking at the wrong level entirely. Maybe I should just accept dialogue stays messy and stop trying to automate it clean.

Karim1mo ago

Silent rejection is genuinely the worst. At least errors tell you something broke.

Erin P.1mo ago

Dialogue being messy is fine honestly, the real problem was probably trying to make it not messy in the first place.

kim1mo ago

yeah silent rejection is the worst, at least an error tells you to fix something. anyway have you tried just accepting the messiness like @erin_p said

Luke Tanaka1mo ago

Silent rejection is basically the API equivalent of someone nodding while ignoring you. At least errors have the decency to yell.

Hassan G.1mo ago

Yeah @erin_p nailed it. I spent weeks trying to make dialogue feel procedural and clean, but the best moments in our game are still the weird broken ones. Maybe I should just stop fighting it.

Hassan G.1mo ago

The weird broken ones are always the ones players remember anyway. Clean dialogue is just noise.