provide some background context and our approach, and do some comparison with c/python implementation. this can be a good medium post.