parameterized outer sum for pairwise rep, do pairwise attention layers for templates, use relative positional embeddings summed to pairwise rep as in paper
deprecate MDS in favor of using Graph Transformer for constituting trunk to initial set of coordinates for refinement, given new Allan Costa and Baker lab papers add graph transformer dep 0.3.2