Comment by IncreasePosts
Comment by IncreasePosts 16 hours ago
Sure - I guess what I was asking is how to make sure everything is okay in the unstructured -> structured conversion.
"My name is John and I'm 40 years old" -> {name:"John", age:40}
How can you gain confidence that the AI doesn't spit out {name:"John", age:41}
The only thing I do currently is have a massive test suite to gain some statistical confidence it works, but I worry about situations like a person having a rare unicode character in their name (not to even speak of people intentionally trying to trick the system)
Don't have the AI do the data parsing. Have the AI write a parser and have the parser do the parsing. Think about how a person would parse vasts amounts of data. They write a parser to do it. Devil is of course in the details.