Self Attention Vs Multi-Head Self Attention