Spark Archives

Incremental Merge with Apache Spark

It is common to ingest a large amount of data into the Hadoop Distributed File System (HDFS) for analysis. And more often than not, we

Databricks Names phData 2020 Rising Star Award Winner

Databricks announced this week during their Partner Summit that phData has been named the 2020 Rising Star Award winner. With multiple joint customer wins over

Spark Job History Server Outofmemoryerror

One of phData’s customers hit an issue where the Spark Job History was running out of memory every few hours. The heap size was set

Category: Spark

Incremental Merge with Apache Spark

Databricks Names phData 2020 Rising Star Award Winner

Spark Job History Server Outofmemoryerror

Join our team

Partners

Resources

Software

Accelerate and automate your data projects with the phData Toolkit

Industries

Solutions

Company

Technology Partners

Other Technology Partners

Check out our latest insights

Data Prep to Production: Object Detection with LandingAI

Marketing Questions phData Can Answer with Data

Data Engineering

Consulting, Migrations, Data Pipelines, DataOps

Change Management, Enablement & Learning

COE, Coaching, PMO

Data Science and Machine Learning Services

MLOps Enablement, Prototyping, Model Development and Deployment

Strategy Services

Data, Analytics, and AI Strategy, Architecture and Assessments

Reporting, Analytics, and Visualization Services

Self-Service, Integrated Analytics, Dashboards, Automation

Elastic Operations

Data Platforms, Data Pipelines, and Machine Learning