Comment by bassp

Comment by bassp 4 days ago

1 reply

Sorry if I was sloppy with my wording, instruction issuance is what I meant :)

I thought that warps weren't issued instructions unless they were ready to execute (ie had all the data they needed to execute the next instruction), and that therefore it was a best practice, in most (not all) cases to have more threads per block than the SM can execute at once so that the warp scheduler can issue instructions to one warp while another waits on a memory read. Is that not true?

einpoklum 4 days ago

> warps weren't issued instructions unless they were ready to execute

This is true, but after they've been issued, it still takes a while for the execution to conclude.

> it was a best practice, in most (not all) cases to have more threads per block than the SM can execute at once

Just replace "most" with "some". It really depends on what kind of kernel you're writing.