Comment by mooreds
How do you test these skills for consistency over time, or is that not needed?
How do you test these skills for consistency over time, or is that not needed?
My experience has been that if the skill is broken down into a function, possibly paired with a validator in another stage, you're at 99.9% deterministic.
I have not yet tested this at scale but give me six months.
The same way you'd test a human following written instructions over time.
Check the results.