Comment by MarceColl
It's tricky I think, I think it comes down to specialized tools for different systems. I'm building a tool for Japanese in this direction. But of course it doesn't generalize to everything since the content and context extraction is very objective dependent.
I agree, it is tricky. But I believe it is possible, see my response in this thread to the user allenu as a middle ground.