J. Choi, A. Singh, and R. Vuduc. Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, page 115--126. New York, NY, USA, ACM, (2010)
N. Bell, and M. Garland. Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, page 18:1--18:11. New York, NY, USA, ACM, (2009)