Comment by bwfan123
Comment by bwfan123 a day ago
Finally some real pushback to the whole agentic mania - from an actor who is incentivized to push the narrative. Following the recent apple paper - some realism is being injected into the hype.
58% success rate on a task is close to a coin flip. and 35% success rate on multiturn. >80% success rate on workflows could make that a reasonable usecase (eg, form filling) with some human supervision.
If it were an employee it would have been fired already, unless it were a nepo hire, and in someways, it is.