Hive Corruption Due to Newlines and Carriage Returns
phData has customers across the spectrum of use cases. One of our customers stores vast volumes of XML. One of our engineers was recently asked:
Hands-On Example with Hive Partitioning
Building off our Simple Examples Series, we wanted to take five minutes and show you how to recognize the power of partitioning. For a more
Binary Stream Ingest: Flume vs Kafka vs Kinesis
Introduction The Internet of Things will put new demands on Hadoop ingest methods, specifically in its ability to capture raw sensor data — binary streams.
Examples Using AVRO and ORC with Hive and Impala
Building off our first post on TEXTFILE and PARQUET, we decided to show examples with AVRO and ORC. AVRO is a row oriented format, while
Examples Using Textfile and Parquet with Hive and Impala
Experience the differences between TEXTFILE, PARQUET, Hive and Impala phData is a fan of simple examples. With that mindset, here is a very quick way