https://towardsdatascience.com/leveraging-apache-spark-to-execute-billions-of-operations-on-aws-s3-2f62930d19fd