I'm attempting different techniques for constructing a kd-tree on a GPU. I'm currently working on using dynamic parallelism to instruct the gpu to calculate an entire level of the kd-tree at the same time (or rather streaming every position in a given level at the same time).
As the values to be placed in the struct are neither calculated nor provided by the host, is there a way to declare the format of a struct on the GPU and then just define an array of that struct type from the kernel which uses it?