Comment by winwang
Yeah, not just OpenCL, but even "newer" standards like WebGPU. I considered making a blog post where I just put the two hello worlds side-by-side and say nothing else.
I was severely disappointed after seeing people praise WebGPU (I believe for being better than OpenGL).
As for the platform-agnostic bytecode, that's where something like MLIR would work too (kind of). But we could also simply just start with transpiling that bytecode into CUDA/PTX.
Better UX with wider platform compatibility: CuPy, Triton.