Comment by exi1up

Comment by exi1up 6 days ago

1 reply

I could be missing something, but is there some sort of metric for these comparisons to other software? Like the BLEU score which I've seen in studies relating to comparing LLMs to Google Translate. I find it difficult to believe it is better than DeepL in a vacuum.

Falimonda 6 days ago

+1

I'm also interested in the benchmarks they've use, if any.