https://medium.com/data-science-collective/understanding-transformer-attention-mechanism-ffed36e821bb