Snowflake ML: How to do Document Classification with Snowpark

An abstract image of data and vectors

Join us on this technical walkthrough as we determine the practicality of the Snowflake Data Cloud and Snowpark for machine learning use-cases. Document Vectors With the success of word embeddings, it’s understood that entire documents can be represented in a similar way. In this case study, we will build a vector that represents a document […]

Is Snowflake Good for Machine Learning?

A picture of a brain to represent machine learning and the Snowflake data cloud logo

You may be wondering whether the Snowflake Data Cloud will help your organization with its machine learning (ML) initiatives.  The short answer to this question is a resounding “yes!” To fully answer this question, however, it’s important to recognize that most ML applications generally follow a common lifecycle. Snowflake is in fact great for ML because it […]

MLOps vs. DevOps: What is the Difference?

Machine learning is a term almost everyone in the IT space has heard by now—but it’s not just a buzzword used in flashy presentations anymore. As machine learning has started to become more applied and less theoretical, the industry has begun to incorporate it into important projects. We’re past the point of proving its value. […]

Bayesian Hyperparameter Optimization with MLflow

The number of boosting iterations proved to be the most significant hyperparameter in our search.

Bayesian hyperparameter optimization is a bread-and-butter task for data scientists and machine-learning engineers; basically, every model-development project requires it.  Hyperparameters are the parameters (variables) of machine-learning models that are not learned from data, but instead set explicitly prior to training – think of them as knobs that need to be fiddled with in order to […]

What Are Machine Learning Frameworks and How to Pick the Best One

There’s no question that artificial intelligence (AI) and machine learning (ML) technologies are already impacting nearly every sector across the globe. Plenty of companies are investing heavily in data science, ML, and AI initiatives to solve their business problems, which may give them an edge over their competitors. When we look at it from an […]

Accelerating ML: What Does MLOps Look Like In Practice?

The components of an MLOps framework sound great, but what do they actually provide to teams on a day-to-day basis? It’s possible to talk at length about feature stores, how to deploy models into production, and the costs involved with doing so, but instead let’s take a look at what MLOps really means to an […]

What is a Feature Store?

A simple diagram of what a feature store is

You’ve got machine learning questions, we’ve got machine learning answers. In this blog post, we’ll explore what a feature store is in the first place, explore a few of the key advantages (and disadvantages) of them, and touch on when is the right time for your organization to build or adopt a feature store. But […]

Federated Learning for Cyber Security: What You Need to Know in 2021

A picture of a number of locks that are all the same color of blue except for one lock that is unlocked and red

It’s nearly impossible to watch the news without hearing the term cybersecurity. Just recently, hackers breached the networks of thousands of private companies and government organizations by attaching malware to a single software update from a company called Solarwinds. This single breach granted access for several months and leaked an unprecedented amount of information, most […]

Machine Learning on Snowflake: Clustering Data with Snowpark

Next up in our blog series on Snowpark, we’ll discuss machine learning basics and K-Means clustering in Snowpark with an example. What is Machine Learning? Machine learning (ML) is established by the evolutionary study of pattern recognition and computational learning theory in artificial intelligence. ML uses algorithms that can learn from and make predictions on […]

Executing Machine Learning Models In Snowpark

Welcome back to our blog series on Snowpark, the latest product from the Snowflake data cloud. In this post, we aim to highlight the use of machine learning with Snowpark by applying the XGBoost algorithm to a dataset using scikit-learn (or sklearn) in Python and export the model to an open format called PMML, the […]