How to Implement Association Rule Mining in Snowpark

Have you ever wondered how recommendations popup as you click on products while shopping online? Think about that “frequently bought together” window you see when you go to check out.  If so, you’re in luck. In this article, we introduce the algorithm commonly used by large e-commerce companies like Amazon to uncover associations between products—known […]

Demand Forecasting Models In Snowpark

Welcome back to our blog series on Snowpark, the latest product from the Snowflake Data Cloud. In this post, we aim to highlight the use of demand forecasting with Snowpark by applying the most popular time series forecasting model (ARIMA), implemented in Java.  We will then use Snowpark to forecast in Snowflake the future demand […]

Snowflake ML: How to do Document Classification with Snowpark

An abstract image of data and vectors

Join us on this technical walkthrough as we determine the practicality of the Snowflake Data Cloud and Snowpark for machine learning use-cases. Document Vectors With the success of word embeddings, it’s understood that entire documents can be represented in a similar way. In this case study, we will build a vector that represents a document […]

A Spark Developer’s Guide to Snowpark

As a Spark developer who uses the Snowflake Data Cloud, you’ve undoubtedly heard all the buzz around Snowpark. Having the ability to execute arbitrary Scala code in your Snowflake compute environment could be game-changing! You begin to wonder how this works in a practical way, and ask yourself the following questions: What is the architecture […]

What is Snowpark — and Why Does it Matter? A phData Perspective


With the explosion of data growth (and options to process data exploding along with it) we continually see customers standardize in a couple of key areas —  regardless of their industry. Data and engineering teams are consistently choosing the Snowflake Data Cloud as the standard for data lakes, data warehouses, machine learning, and data strategy. […]

Machine Learning on Snowflake: Clustering Data with Snowpark

Next up in our blog series on Snowpark, we’ll discuss machine learning basics and K-Means clustering in Snowpark with an example. What is Machine Learning? Machine learning (ML) is established by the evolutionary study of pattern recognition and computational learning theory in artificial intelligence. ML uses algorithms that can learn from and make predictions on […]

Executing Machine Learning Models In Snowpark

Welcome back to our blog series on Snowpark, the latest product from the Snowflake data cloud. In this post, we aim to highlight the use of machine learning with Snowpark by applying the XGBoost algorithm to a dataset using scikit-learn (or sklearn) in Python and export the model to an open format called PMML, the […]

Complete Installation Guide of Snowpark on Linux

In this technical blog post, we’ll walk you through how to install Snowpark using the Linux version, more specifically, the Ubuntu 20.04 x86-64 version of IntelliJ with version 2021.1.1. It should be noted that this installation process is similar on other operating systems. In particular, once IntelliJ is installed, the process should be identical to […]

How to Install Snowpark Into IntelliJ IDEA

Welcome to our second blog in our series on Snowpark! This post will discuss the Snowflake Data Cloud’s new product, Snowpark, and its installation with IntelliJ IDEA. Let’s dive in! About Snowpark Snowpark is a new application programmatic interface (API) from Snowflake that allows developers, data engineers, and data scientists to programmatically perform data operations. […]

How to Connect to Snowpark With VSCode

Congratulations on your interest in Snowpark!  To help you get started, we put together this handy tutorial to walk you through getting connected to your Snowflake account. Our goal is that you find the following tutorial helpful on your journey to unlocking data engineering capabilities natively on the Snowflake data cloud. If you have any […]