spark

To ETL or not to ETL? The benefits and limitations of ETL Tools over data scripting languages

2019-06-11T09:27:13-06:00

ETL refers to "Extract, Transform, and Load". By this point, the concept of ETL is well-established within the data industry and there are a number of enterprise-proven "ETL Tools" available to assist organizations with the movement and transformation of data. An ETL Tool is simply software that's designed to help organizations move and transform data [...]

To ETL or not to ETL? The benefits and limitations of ETL Tools over data scripting languages2019-06-11T09:27:13-06:00

5 Things to Know about Databricks

2019-01-30T13:17:42-07:00

Databricks is now available in both AWS and Azure so it’s getting a lot of buzz! Let’s discuss 5 things you should know about Databricks before diving in. 1.     Databricks is a managed Spark-based service for working with data in a cluster Databricks is an enhanced version of Spark and is touted by the Databricks [...]

5 Things to Know about Databricks2019-01-30T13:17:42-07:00