Comment by nivter
This is far from what I expected. There is not much related to quantization, pruning, common architectures, precision or benchmarking. For those interested in this topic, I would recommend content from MIT HAN Lab.
This is far from what I expected. There is not much related to quantization, pruning, common architectures, precision or benchmarking. For those interested in this topic, I would recommend content from MIT HAN Lab.
May be this one: https://hanlab.mit.edu/courses/2024-fall-65940
Can you provide links or more information?