https://medium.com/@ajay.nextstep/merging-too-many-small-files-into-fewer-large-files-using-apache-spark-in-datalake-ff9a32807056