Yes, but how cheap is it to run four at the same time? It’s tough to run one goo... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

EduardoBautista 45 days ago | parent | context | favorite | on: Computer Use is 45x more expensive than structured...

Yes, but how cheap is it to run four at the same time? It’s tough to run one good model locally, but running four at the same time which I commonly do with Claude and Codex just doesn’t seem to be happening anytime soon.

Aurornis 45 days ago [–]

I'm referring to hosted models such as via OpenRouter or from the model providers' own services.

I think everyone making claims that inference is getting more expensive are unaware that there are more LLM providers than Google, Anthropic, and OpenAI.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact