Incremental Merge with Apache Spark
It is common to ingest a large amount of data into the Hadoop Distributed File System (HDFS) for analysis. And more often than not, we
Databricks Names phData 2020 Rising Star Award Winner
Databricks announced this week during their Partner Summit that phData has been named the 2020 Rising Star Award winner. With multiple joint customer wins over
Spark Job History Server Outofmemoryerror
One of phData’s customers hit an issue where the Spark Job History was running out of memory every few hours. The heap size was set