 Sorry . I have one question about the codes. Why the code to compute the bottleneck_L2 and the code to compute the transformer_L2 is the same when the parameter only_part is False? Looking forward your reply!