Comment by Lerc
I think it would be nearly impossible to prove that your corpus had no mathematical content. In fact it would be extremely contentious as to what was considered mathematical content. Do you remove all reference to numbers and counting? How about the words and, or and not.
I think the only criteria of no math that would satisfy some people would by definition fail because the model would have no concept of points or distance because some would certainly count those as mathematical content.