Comment by data_maan
"Open source" lol
Open-weight. As usual, you don't get the dataset, training scripts, etc.
"Open source" lol
Open-weight. As usual, you don't get the dataset, training scripts, etc.
This seems significantly more permissive than GPL. I think it's reasonable to consider it open-weight.
So "MIT with attribution" (but only for huge commercial use cases making tons of money off the product) is not open-weight? Do you consider CC BY photos on Wikipedia to be Image Available or GPL licensed software to be code-available too?
Tangent: I don't understand the contingent that gets upset about open LLMs not shipping with their full training regimes or source data. The software a company spent hundreds of millions of dollars creating, which you are now free to use and distribute with essentially no restrictions, is open source. It has weights in it, and a bunch of related software for actually running a model with those weights. How dare they!
4-clause BSD is considered open source by Debian and the FSF and has a similar requirement.
Wont happen under the current copyright regime, it is impossible to train SOTA without copyrighted text, how do you propose distributing that?
It's not even open-weight. It's weight-available. It uses a "modified MIT license":