Comment by lyu07282
This would also be very helpful for datasets, we need to preserve those indefinitely in order to be able to make meaningful benchmarks in the future. Some datasets will only contain references to the source material and labels, each researcher is then supposed to assemble the actual dataset from those references.