Back to Spark

MLlib (RDD-based)

python/docs/source/reference/pyspark.mllib.rst

4.1.14.6 KB
Original Source

.. Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

.. http://www.apache.org/licenses/LICENSE-2.0

.. Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

MLlib (RDD-based)

Classification

.. currentmodule:: pyspark.mllib.classification

.. autosummary:: :template: autosummary/class_with_docs.rst :toctree: api/

LogisticRegressionModel
LogisticRegressionWithSGD
LogisticRegressionWithLBFGS
SVMModel
SVMWithSGD
NaiveBayesModel
NaiveBayes
StreamingLogisticRegressionWithSGD

Clustering

.. currentmodule:: pyspark.mllib.clustering

.. autosummary:: :template: autosummary/class_with_docs.rst :toctree: api/

BisectingKMeansModel
BisectingKMeans
KMeansModel
KMeans
GaussianMixtureModel
GaussianMixture
PowerIterationClusteringModel
PowerIterationClustering
StreamingKMeans
StreamingKMeansModel
LDA
LDAModel

Evaluation

.. currentmodule:: pyspark.mllib.evaluation

.. autosummary:: :template: autosummary/class_with_docs.rst :toctree: api/

BinaryClassificationMetrics
RegressionMetrics
MulticlassMetrics
RankingMetrics

Feature

.. currentmodule:: pyspark.mllib.feature

.. autosummary:: :template: autosummary/class_with_docs.rst :toctree: api/

Normalizer
StandardScalerModel
StandardScaler
HashingTF
IDFModel
IDF
Word2Vec
Word2VecModel
ChiSqSelector
ChiSqSelectorModel
ElementwiseProduct

Frequency Pattern Mining

.. currentmodule:: pyspark.mllib.fpm

.. autosummary:: :template: autosummary/class_with_docs.rst :toctree: api/

FPGrowth
FPGrowthModel
PrefixSpan
PrefixSpanModel

Vector and Matrix

.. currentmodule:: pyspark.mllib.linalg

.. autosummary:: :template: autosummary/class_with_docs.rst :toctree: api/

Vector
DenseVector
SparseVector
Vectors
Matrix
DenseMatrix
SparseMatrix
Matrices
QRDecomposition

Distributed Representation


.. currentmodule:: pyspark.mllib.linalg.distributed

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    BlockMatrix
    CoordinateMatrix
    DistributedMatrix
    IndexedRow
    IndexedRowMatrix
    MatrixEntry
    RowMatrix
    SingularValueDecomposition


Random
------

.. currentmodule:: pyspark.mllib.random

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    RandomRDDs


Recommendation
--------------

.. currentmodule:: pyspark.mllib.recommendation

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    MatrixFactorizationModel
    ALS
    Rating


Regression
----------

.. currentmodule:: pyspark.mllib.regression

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    LabeledPoint
    LinearModel
    LinearRegressionModel
    LinearRegressionWithSGD
    RidgeRegressionModel
    RidgeRegressionWithSGD
    LassoModel
    LassoWithSGD
    IsotonicRegressionModel
    IsotonicRegression
    StreamingLinearAlgorithm
    StreamingLinearRegressionWithSGD


Statistics
----------

.. currentmodule:: pyspark.mllib.stat

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    Statistics
    MultivariateStatisticalSummary
    ChiSqTestResult
    MultivariateGaussian
    KernelDensity
    ChiSqTestResult
    KolmogorovSmirnovTestResult


Tree
----

.. currentmodule:: pyspark.mllib.tree

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    DecisionTreeModel
    DecisionTree
    RandomForestModel
    RandomForest
    GradientBoostedTreesModel
    GradientBoostedTrees


Utilities
---------

.. currentmodule:: pyspark.mllib.util

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    JavaLoader
    JavaSaveable
    LinearDataGenerator
    Loader
    MLUtils
    Saveable