Blog

The Paradox of Agile Data Management

At phData, many of us come from a software development background and have witnessed the success of Agile Methodologies. Agile started in software development where it quickly gained popularity, but has also now made inroads into other realms. The concept of the “Agile Admin”, or as it’s better known, Devops, takes many of its core […]

Read More

Discussing On-Premise vs Off-Premise (aka Cloud) Hadoop

Introduction Companies making an investment in Hadoop have a ton of things to consider.  What problem are they trying to solve?  What Hadoop services are they looking to implement?  How are they going to support them?  Etc,. However, one of the biggest questions companies need to answer first is: are they going to run Hadoop […]

Read More

The Truth about SQL on Hadoop (part 3)

This is a multi-part blog post meant to be an exhaustive introduction to SQL-on-Hadoop. The first part in this series covered Storage Engines and Online Transaction Processing (OLTP). The next post covered Online Analytical Processing (OLAP) while this post will cover engine retrofits for Hadoop and choosing among the alternatives. Retrofits When breaking this topic […]

Read More

What Everybody Ought to Know About Big Data

In Underhyped – Big Data as an Advance in the Scientific Method Yanpei Chen makes the argument that big data is a fundamental advancement to the scientific method. This is an exceedingly bold claim and to be honest I suspected a strong dose of sensationalism. I mentioned this to a shared mutual acquaintance. I was informed that […]

Read More

The Truth about SQL on Hadoop (part 2)

This is a multi-part blog post meant to be an exhaustive introduction to SQL-on-Hadoop. The first part in this series covered Storage Engines and Online Transaction Processing (OLTP). This post will cover Online Analytical Processing (OLAP) while the third in the series will cover engine retrofits for Hadoop and choosing among the alternatives. Data processing and […]

Read More

The Truth about SQL on Hadoop (part 1)

This is a multi-part blog post meant to be an exhaustive introduction to SQL-on-Hadoop. The first part in this series will cover Storage Engines and Online Transaction Processing (OLTP). The next post will cover Online Analytical Processing (OLAP) while the third in the series will cover engine retrofits for Hadoop and choosing among the alternatives. […]

Read More