Tensor Parallelism
Step 1 of 5
Column/row parallel linears and attention head splitting with pure PyTorch.