Comment by mitthrowaway2

Comment by mitthrowaway2 3 months ago

27 replies

View on Hacker News

What's an example of an intellectual task that you don't think AI will be capable of by 2027?

jdauriemma 3 months ago

Being accountable for telling the truth

Reply View 1 reply

myhf 3 months ago

accountability sinks are all you need

Reply View | 0 replies

kubb 3 months ago

It won't be able to write a compelling novel, or build a software system solving a real-world problem, or operate heavy machinery, create a sprite sheet or 3d models, design a building or teach.

Long term planning and execution and operating in the physical world is not within reach. Slight variations of known problems should be possible (as long as the size of the solution is small enough).

Reply View 10 replies

lumenwrites 3 months ago

I'm pretty sure you're wrong for at least 2 of those:
For 3D models, check out blender-mcp:
https://old.reddit.com/r/singularity/comments/1joaowb/claude...
https://old.reddit.com/r/aiwars/comments/1jbsn86/claude_crea...
Also this:
https://old.reddit.com/r/StableDiffusion/comments/1hejglg/tr...
For teaching, I'm using it to learn about tech I'm unfamiliar with every day, it's one of the things it's the most amazing at.
For the things where the tolerance for mistakes is extremely low and the things where human oversight is extremely importamt, you might be right. It won't have to be perfect (just better than an average human) for that to happen, but I'm not sure if it will.

Reply View | 4 replies
- kubb 3 months ago
  
  Just think about the delta of what the LLM does and what a human does, or why can’t the LLM replace the human, e.g. in a game studio.
  If it can replace a teacher or an artist in 2027, you’re right and I’m wrong.
  
  Reply View | 3 replies
  
  esafak 3 months ago
  
  It's already replacing artists; that's why they're up in arms. People don't need stock photographers or graphic designers as much as they used to.
  https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4602944
  
  Reply View | 2 replies
pixl97 3 months ago

> or operate heavy machinery
What exactly do you mean by this one?
In large mining operations we already have human assisted teleoperation AI equipment. Was watching one recently where the human got 5 or so push dozers lined up with a (admittedly simple) task of cutting a hill down and then just got them back in line if they ran into anything outside of their training. The push and backup operations along with blade control were done by the AI/dozer itself.
Now, this isn't long term planning, but it is operating in the real world.

Reply View | 1 reply
- kubb 3 months ago
  
  Operating an excavator when building a stretch of road. Won’t happen by 2027.
  
  Reply View | 0 replies
programd 3 months ago

Does a fighter jet count as "heavy machinery"?
https://apnews.com/article/artificial-intelligence-fighter-j...

Reply View | 2 replies
- kubb 3 months ago
  
  Yes, when they send unmanned jets to combat.
  
  Reply View | 1 reply
  
  Philpax 3 months ago
  
  It's already starting with the drones: https://www.csis.org/analysis/ukraines-future-vision-and-cur...
  
  Reply View | 0 replies

coolThingsFirst 3 months ago

programming

Reply View 13 replies

lumenwrites 3 months ago

Why would it get 60-80% as good as human programmers (which is what the current state of things feels like to me, as a programmer, using these tools for hours every day), but stop there?

Reply View | 10 replies
- burningion 3 months ago
  
  So I think there's an assumption you've made here, that the models are currently "60-80% as good as human programmers".
  If you look at code being generated by non-programmers (where you would expect to see these results!), you don't see output that is 60-80% of the output of domain experts (programmers) steering the models.
  I think we're extremely imprecise when we communicate in natural language, and this is part of the discrepancy between belief systems.
  Will an LLM model read a person's mind about what they want to build better than they can communicate?
  That's already what recommender systems (like the TikTok algorithm) do.
  But will LLMs be able to orchestrate and fill in the blanks of imprecision in our requests on their own, or will they need human steering?
  I think that's where there's a gap in (basically) belief systems of the future.
  If we truly get post human-level intelligence everywhere, there is no amount of "preparing" or "working with" the LLMs ahead of time that will save you from being rendered economically useless.
  This is mostly a question about how long the moat of human judgement lasts. I think there's an opportunity to work together to make things better than before, using these LLMs as tools that work _with_ us.
  
  Reply View | 0 replies
- kody 3 months ago
  
  It's 60-80% as good as Stack Overflow copy-pasting programmers, sure, but those programmers were already providing questionable value.
  It's nowhere near as good as someone actually building and maintaining systems. It's barely able to vomit out an MVP and it's almost never capable of making a meaningful change to that MVP.
  If your experiences have been different that's fine, but in my day job I am spending more and more time just fixing crappy LLM code produced and merged by STAFF engineers. I really don't see that changing any time soon.
  
  Reply View | 4 replies
  
  lumenwrites 3 months ago
  
  I'm pretty good at what I do, at least according to myself and the people I work with, and I'm comparing its capabilities (the latest version of Claude used as an agent inside Cursor) to myself. It can't fully do things on its own and makes mistakes, but it can do a lot.
  But suppose you're right, it's 60% as good as "stackoverflow copy-pasting programmers". Isn't that a pretty insanely impressive milestone to just dismiss?
  And why would it just get to this point, and then stop? Like, we can all see AIs continuously beating the benchmarks, and the progress feels very fast in terms of experience of using it as a user.
  I'd need to hear a pretty compelling argument to believe that it'll suddenly stop, something more compelling than "well, it's not very good yet, therefore it won't be any better", or "Sam Altman is lying to us because incentives".
  Sure, it can slow down somewhat because of the exponentially increasing compute costs, but that's assuming no more algorithmic progress, no more compute progress, and no more increases in the capital that flows into this field (I find that hard to believe).
  
  Reply View | 3 replies
- coolThingsFirst 3 months ago
  
  Try this, launch Cursor.
  Type: print all prime numbers which are divisible by 3 up to 1M
  The result is that it will do a sieve. There's no need for this, it's just 3.
  
  Reply View | 1 reply
  
  mysfi 3 months ago
  
  Just tried this with Gemini 2.5 Pro. Got it right with meaningful thought process.
  
  Reply View | 0 replies
- boringg 3 months ago
  
  Because ewe still haven't figured out fusion but its been promised for decades. Why would everything thats been promised by people with highly vested interests pan out any different?
  One is inherently a more challenging physics problem.
  
  Reply View | 0 replies
- [removed] 3 months ago
  
  [deleted]
  
  Reply View | 0 replies
mitthrowaway2 3 months ago

Can you phrase this in a concrete way, so that in 2027 we can all agree whether it's true or false, rather than circling a "no true scotsman" argument?

Reply View | 1 reply
- abecedarius 3 months ago
  
  Good question. I tried to phrase a concrete-enough prediction 3.5 years ago, for 5 years out at the time: https://news.ycombinator.com/item?id=29020401
  It was surpassed around the beginning of this year, so you'll need to come up with a new one for 2027. Note that the other opinions in that older HN thread almost all expected less.
  
  Reply View | 0 replies