SANSA-ML

SANSA-ML is the Machine Learning (ML) library in the SANSA stack (see http://sansa-stack.net). Algorithms in this repository perform various machine learning tasks directly on RDF/OWL input data. While most machine learning algorithms are based on processing simple features, the machine learning algorithms in SANSA-ML exploit the graph structure and semantics of the background knowledge specified using the RDF and OWL standards. In many cases, this allows to obtain either more accurate or more human-understandable results. In contrast to most other algorithms supporting background knowledge, they scale horizontally using Apache Spark.

The ML layer currently supports the following algorithms:

RDF graph clustering
Rule mining in RDF graphs based on AMIE+

Usage example for clusting:

RDFByModularityClustering(sparkSession.sparkContext, numIterations, input, output)

Please see https://github.com/SANSA-Stack/SANSA-Examples/tree/master/sansa-examples-spark/src/main/scala/net/sansa_stack/examples/spark/ml for further examples.

Several further algorithms are in development. Please create a pull request and/or contact Jens Lehmann if you are interested in contributing algorithms to SANSA-ML.

Support for Apache Flink is planned in future releases.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
sansa-ml-common		sansa-ml-common
sansa-ml-flink		sansa-ml-flink
sansa-ml-spark		sansa-ml-spark
sansa-ml-tests		sansa-ml-tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ml-parent.iml		ml-parent.iml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SANSA-ML

About

Releases

Packages

Languages

License

Hebaallahibrahim/SANSA-ML

Folders and files

Latest commit

History

Repository files navigation

SANSA-ML

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages