Skip to main content

SynapseML

Simple and Distributed Machine Learning

Coming from MMLSpark? We have been renamed to SynapseML!
from synapse.ml.cognitive import *
sentiment_df = (TextSentiment()
.setTextCol("text")
.setLocation("eastus")
.setSubscriptionKey(key)
.setOutputCol("sentiment")
.setErrorCol("error")
.setLanguageCol("language")
.transform(input_df))
Read more
Simple

Simple

Quickly create, train, and use distributed machine learning tools in only a few lines of code.

Scalable

Scalable

Scale ML workloads to hundreds of machines on your Apache Spark cluster.

Multilingual

Multilingual

Use SynapseML from any Spark compatible language including Python, Scala, R, Java, .NET and C#.

Open

Open

SynapseML is Open Source and can be installed and used on any Spark 3 infrastructure including your local machine, Databricks, Synapse Analytics, and others.

Installation

Written in Scala, and support multiple languages. Open source and cloud native.

SynapseML is preinstalled on Fabric. To install a different version, add the following to the first cell of a notebook:

%%configure -f
{
"name": "synapseml",
"conf": {
"spark.jars.packages": "com.microsoft.azure:synapseml_2.12:[THE_SYNAPSEML_VERSION_YOU_WANT]",
"spark.jars.repositories": "https://mmlspark.azureedge.net/maven",
"spark.jars.excludes": "org.scala-lang:scala-reflect,org.apache.spark:spark-tags_2.12,org.scalactic:scalactic_2.12,org.scalatest:scalatest_2.12,com.fasterxml.jackson.core:jackson-databind",
"spark.yarn.user.classpath.first": "true",
"spark.sql.parquet.enableVectorizedReader": "false"
}
}