aws

To ETL or not to ETL? The benefits and limitations of ETL Tools over data scripting languages

2019-06-11T09:27:13-06:00

ETL refers to "Extract, Transform, and Load". By this point, the concept of ETL is well-established within the data industry and there are a number of enterprise-proven "ETL Tools" available to assist organizations with the movement and transformation of data. An ETL Tool is simply software that's designed to help organizations move and transform data [...]

To ETL or not to ETL? The benefits and limitations of ETL Tools over data scripting languages2019-06-11T09:27:13-06:00

5 Things to Know about Databricks

2019-01-30T13:17:42-07:00

Databricks is now available in both AWS and Azure so it’s getting a lot of buzz! Let’s discuss 5 things you should know about Databricks before diving in. 1.     Databricks is a managed Spark-based service for working with data in a cluster Databricks is an enhanced version of Spark and is touted by the Databricks [...]

5 Things to Know about Databricks2019-01-30T13:17:42-07:00

What is a data warehouse and why do I need one?

2019-01-29T12:53:09-07:00

While the data warehouse isn’t a new concept, many are left wondering – what is a data warehouse? And why do I need one? I'm sad to tell you that when we say "data warehouse", we aren't referring to a physical building that stores data! Datalere’s official definition of a data warehouse is the following [...]

What is a data warehouse and why do I need one?2019-01-29T12:53:09-07:00

Natural Language Processing: What Is It & How Can It Be Used?

2019-01-29T14:03:11-07:00

Almost every firm, company, or agency has a collection of text data that is difficult to manage. Big blocks of text, in sentences or paragraphs, in Word documents or text files, can't be easily queried, searched, averaged, or summarized by database analysts like numeric fields stored in databases. Dedicating an employee to reading and interpreting every [...]

Natural Language Processing: What Is It & How Can It Be Used?2019-01-29T14:03:11-07:00

AWS EC2 Linux AMI with pyodbc

2018-07-08T20:40:56-06:00

AWS EC2 Linux AMI with pyodbc, psqlODBC, Microsoft ODBC Drivers Context: Using pyodbc within a small python application to pull tracking data from an Azure SQL Server DB, do some work, and then store the results into AWS RDS PostreSQL DB. Problem: Default AWS EC2 Linux AMI 2017.09.1 does not come with unixODBC 2.3.x which [...]

AWS EC2 Linux AMI with pyodbc2018-07-08T20:40:56-06:00

Datalere Presents Data Science, Cloud, Business Intelligence and Data Warehousing at PASS Summit

2018-02-20T08:54:22-07:00

Pacific North West, 4000 attendees, training, trends and worldwide industry experts. In a nut shell, the world’s largest conference for Microsoft Data Platform is happening this fall in Seattle. Did I mention that I am speaking at this event? The Professional Association of SQL Server is putting on its 19th Summit. This conference provides intense [...]

Datalere Presents Data Science, Cloud, Business Intelligence and Data Warehousing at PASS Summit2018-02-20T08:54:22-07:00