Building Faster AMD64 Memset Routines