Text this: Searching CUDA code autotuning spaces with hardware performance counters: data from benchmarks running on various GPU architectures