Comment by measurablefunc

Comment by measurablefunc 5 hours ago

3 replies

You're just moving the goal post & not addressing the question I asked. Why isn't AI optimizing the kernels in its own code the way people have been optimizing it like in the posted paper?

phkahler 5 hours ago

It will, right after it reads the paper.

  • measurablefunc 5 hours ago

    I read the paper. All the prerequisites are already available in existing literature & they basically profiled & optimized around the bottlenecks to avoid pipeline stalls w/ instructions that utilize the available tensor & CUDA cores. Seems like something these super duper AIs that don't get tired should be able to do pretty easily.