If you add "–save-temps" to invocation of create program (or whatever it is called) you will see the generated assembly files. You can modify these and can then create an opencl program from this assembly.
Another way to get the assembly output it to build llvm with support for amd gpu and then compile this to assembly using the -S flag.
The command line I used for polaris10, with opencl standard 2.0 was:
clang -std=CL2.0 -target amdgcn-amd-amdpal-opencl -mcpu=polaris10 -S -c foo.cl -O3
You need to modify foo.cl to include opencl-c.h, by adding
#include <opencl-c.h>
to the first line of the opencl source file.
This will then generate the assembly file as foo.s in the current directory. To generate code for an R9 fury, change the mcpu to fiji. ie.
... -mcpu=fiji ...