Comment by lode

Comment by lode 2 days ago

43 replies

I tried it out yesterday, after reading the enthousiastic article at https://www.macstories.net/stories/clawdbot-showed-me-what-t...

Setting it up was easy enough, but just as I was about to start linking it to some test accounts, I noticed I already had blown through about $5 of Claude tokens in half an hour, and deleted the VPS immediately.

Then today I saw this follow up: https://mastodon.macstories.net/@viticci/115968901926545907 - the author blew through $560 of tokens in a weekend of playing with it.

If you want to run this full time to organise your mailbox and your agenda, it's probably cheaper to hire a real human personal assistant.

quietsegfault 2 days ago

Just watch a few videos on Clawdbot. You'll invariably see some influencer's Anthropic key, and just use that. Wokka wokka!

0xbadcafebee 2 days ago

If you have an old M1 Macbook lying around, you use that to run a local model. Then it only costs whatever the electricity costs. May not be a frontier model, but local models are insanely good now compared to before. Some people are buying Mac Minis for this, but there's many kinds of old/cheap hardware that works. An old 1U/2U server some company's throwing out with a tech refresh, lots of old RAM, an old GPU off eBay, is pretty perfect. MacBook M1 Max or Mac Mini w/64GB RAM is much quieter, power efficient, compact. But even my ThinkPad T14s runs local models. Then you can start optimizing inference settings and get it to run nearly 2x faster.

(keep in mind with the cost savings: do an initial calculation of your cloud cost first with a low-cost cloud model, not the default ones, and then multiply times 1-2 years, compare that cost to the cost of a local machine + power bill. don't just buy hardware because you think it's cheaper; cloud models are generally cost effective)

  • muwtyhg 2 days ago

    > don't just buy hardware because you think it's cheaper

    Surely there is also the benefit of data privacy and not having a private company creating yet another ad profile of me to sell later on?

wartywhoa23 2 days ago

Huge pyramids are built of relatively small blocks, kudos to everyone contributed.

  • Sharlin 2 days ago

    "Pyramid" is an interesting metaphor to use, given the connotations.

    • pohl 2 days ago

      Are you alluding to pyramid schemes or “Look on my Works, ye Mighty, and despair”?

      • Sharlin 2 days ago

        I was thinking of the former, but the latter could certainly apply too.

turnsout 2 days ago

Yeah, I looked at Clawdbot / OpenClaw at the beginning of the week (Monday), but the token use scared me off.

But I was inspired to use Claude Code to create my own personal assistant. It was shocking to see CC bang out an MVP in one Plan execution. I've been iterating it all week, but I've had it be careful with token usage. It defaults to Haiku (more than enough for things like email categorization), properly uses prompt caching, and has a focused set of tools to avoid bloating the context window. The cost is under $1 per check-in, which I'm okay with.

Now I get a morning and afternoon check-in about outstanding items, and my Inbox is clear. I can see this changing my relationship to email completely.

  • azinman2 2 days ago

    Post it!

    • turnsout 2 days ago

      A lot of the system prompt, skills and tools center around my specific needs (I manage separate IMAP and Gmail inboxes, use Granola, and have iCloud calendars). And there are some hard assumptions baked in (I want to have a morning & afternoon check-in). It probably wouldn't be useful as-is, but maybe as inspiration?

      • browningstreet 2 days ago

        I'd love to see even a filtered version of it. I've been doing very similar things with an "everything" database. That's been my own personal northstar.

        BTW, OpenCode has free Kimi (I haven't hit a quota yet) right now and it's done pretty great things for me in the last 24 hours.

      • RickS 2 days ago

        If it was oneshotted, I'd be curious to see the prompt

        • turnsout 2 days ago

          I wouldn't say it was oneshotted, but it did produce a working MVP in one Plan execution. Meaning, I went back & forth a few times about requirements, it built a plan, and then CC spent just under 15 minutes writing the code. Once I got the credentials plugged in, the core integrations (Slack, gmail, IMAP, iCloud calendar) and agent loop did work. I can share the initial message if you're curious.

columk a day ago

That's the sad thing. There are so many millions of talented under-employed people in the world that would gladly run errands or set up automations for you for $200-$1000 per month or whatever people are spending on this bot.

Developers trust lobsters more than humans.

The other wild thing is that many of these expensive automations that are being celebrated on X can already be done by voice using Siri, Google, or any MCP client.

geek_slop 2 days ago

I had the same problem. Ask Clawdbot to optimize token usage. It cut my usage in half.

ern_ave 2 days ago

Can't you just point it at a local ollama? It'd be slower, but free (except for your electricity bill).

itissid 2 days ago

I think one thing these things could benefit from is an optimization algorithm that creates prompts based on various costs. $$, and what prompts actually gives good results. But it's not an optimization algorithm in the sense gradient descent is, but more like Bandits and RL.

There has been some work around this practically being tried out using it for structured data outputs from LLMs https://docs.boundaryml.com/guide/baml-advanced/prompt-optim...

I won't claim I understand its implementation very well but it seems like the only approach to have a GOFAI style thing where the agent can ask for human help if it blows through a budget

lurking_swe 2 days ago

part of me sympathizes, but part of me also rolls my eyes. Am i the only one that’s configuring limits on spend and also alerts? Takes 2 seconds to configure a “project” in OpenAI or Claude and to scope an api key appropriately.

Not doing so feels like asking for trouble.

  • lode 2 days ago

    That's what I did, which is why I abandoned my experiment this quickly.

    I'd find it hard to write such an article about how this is the next best thing since sliced bread without mentioning it spending so much money.

    • lurking_swe 2 days ago

      good on you! The anecdote of that person spending hundreds of dollar is scary.

  • jmathai 2 days ago

    Are you all enabling auto reload for personal projects?

    I load $20 at a time and wait for it to break and add more.

    • fnordlord 2 days ago

      Can you get meaningful work done with CC at $20 at a time? I load $20 at a time onto the API for general chatting purposes and it lasts a few months at a time. I've always avoided trying CC because I got the impression people were burning $100+/mo, which is beyond my personal hobby budget.

      • sanarothe 2 days ago

        /Not a software engineer perspective working on side projects

        I guess if you're letting it vibe code huge chunks. I'm doing mostly handwritten code for my current project with a little bit of "I don't want to deal with this, Claude can handle it" and I've spent $1.26 this month for my 446 lines of code.

        But yes I suppose at that rate, if Gastown or Beads or whatever is 300,000 lines of code (just to use a project known to be fully vibe coded with rough LOC reported), that would be over $800.

        Don't let it vibe code hundreds of thousands of lines of code I guess.

      • TheGRS 2 days ago

        I was doing that initially, but I think the subscriptions are generally worth it for personal projects. $20/mo is good if you're like me and you can do this stuff maybe a couple nights a week, I haven't run into the limitations on that yet. The $100+ subscriptions are needed if you're doing it every day. YMMV

      • quietsegfault 2 days ago

        I'm successful with personal projects (reverse engineering USB devices, sledding spot finder, silly stuff) on the $20/mo Claude plan. I rarely use Opus except for planning larger things.

      • browningstreet 2 days ago

        I keep a master llm.md file and rotate between Claude Code (Pro), Antigravity Opus, Antigravity Flash, and OpenCode Kimi. I don't actually mind hitting limits.. though I'm least happy when Opus goes away.

        My entire process is to build a generic llm.md file that all the tools can use and record to. I don't want to be tied completely to any one solution. You can get pretty far without spending a lot on tokens. I can run almost continually, and presently I'm the bottleneck anyway.

      • jmathai 2 days ago

        For Claude Code, I now pay the $20/mo subscription for pro because I was spending more using it via API credits.

        Even if I had to reload manually very often, I still would not enable auto reload. These APIs are crazy expensive and I'm not looking for a surprise bill.

  • iamtheworstdev 2 days ago

    not only that, but clawdbot/moltbot/openclaw/whatever they call themselves tomorrow/etc also tells you your token usage and how much you have left on your plan while you're using it (in the terminal/console). So this is pretty easily tracked...

jauntywundrkind 2 days ago

Would have been $68 on DeepSeek, which is also imho very good.

I still have Opus review the shit out of & plan my work. But it doesn't need to be hands on keyboard doing the work.

guluarte 2 days ago

you can use your claude max subscription

  • swordsith 2 days ago

    oh yeah let me just pull my 200$ monthly subscription out of my back pocket

    • guluarte 2 days ago

      yeah it is only worth it if you are already paying otherwise it is not

  • preommr 2 days ago

    Isn't that explictly against the TOS? I feel like Anthropic brought out the ban hammer a few days ago for things like opencode because it wasn't using the apis but the max subscriptions that are pretty much only allowed through things like claude code.