Which version of keras was used for this implementation? Also, which backend was used, and which version of the backend?