StreamSets

Hadoop meets Blockchain: Trust your (Big) Data

At a simple level, Blockchains solve a trust problem. Increasingly, companies are relying on third parties to help drive brand recognition and gain consumer trust, this includes trusting third party data.  For these companies to succeed it is vital that the data they receive is trustworthy and accurate. Each organization involved needs to trust that […]

Read More

Archiving Navigator Audit Data with StreamSets and Kafka

Andy Stadtler helped with this post Many of phData’s customers are heavy users of Cloudera Navigator. Cloudera Navigator provides metadata information to the user who can also audit all actions performed on data in the cluster. Per day one customer generates an average of 4GB Audit Data, which is stored by default in the mysql […]

Read More