Processing PDF data with Apache PDFbox and Apache Spark at scale on Databricks.

Processing PDF data with Apache PDFbox and Apache Spark at scale on Databricks.

4 years ago
Anonymous $drS9DEX_Sj

Processing PDF data with Apache PDFbox and Apache Spark at scale on Databricks.

Jul 25, 2021, 10:25am UTC
https://medium.com/@debusinha2009/processing-pdf-data-with-apache-pdfbox-and-apache-spark-at-scale-on-databricks-85b4f8daee78