Packages

  • package root
    Definition Classes
    root
  • package org
    Definition Classes
    root
  • package apache
    Definition Classes
    org
  • package spark
    Definition Classes
    apache
  • package mllib

    RDD-based machine learning APIs (in maintenance mode).

    RDD-based machine learning APIs (in maintenance mode).

    The spark.mllib package is in maintenance mode as of the Spark 2.0.0 release to encourage migration to the DataFrame-based APIs under the org.apache.spark.ml package. While in maintenance mode,

    • no new features in the RDD-based spark.mllib package will be accepted, unless they block implementing new features in the DataFrame-based spark.ml package;
    • bug fixes in the RDD-based APIs will still be accepted.

    The developers will continue adding more features to the DataFrame-based APIs in the 2.x series to reach feature parity with the RDD-based APIs. And once we reach feature parity, this package will be deprecated.

    Definition Classes
    spark
    See also

    SPARK-4591 to track the progress of feature parity

  • package classification
    Definition Classes
    mllib
  • package clustering
    Definition Classes
    mllib
  • package evaluation
    Definition Classes
    mllib
  • package feature
    Definition Classes
    mllib
  • package fpm
    Definition Classes
    mllib
  • package linalg
    Definition Classes
    mllib
  • package optimization
    Definition Classes
    mllib
  • package pmml
    Definition Classes
    mllib
  • package random
    Definition Classes
    mllib
  • package rdd
    Definition Classes
    mllib
  • MLPairRDDFunctions
  • RDDFunctions
  • package recommendation
    Definition Classes
    mllib
  • package regression
    Definition Classes
    mllib
  • package stat
    Definition Classes
    mllib
  • package tree

    This package contains the default implementation of the decision tree algorithm, which supports:

    This package contains the default implementation of the decision tree algorithm, which supports:

    • binary classification,
    • regression,
    • information loss calculation with entropy and Gini for classification and variance for regression,
    • both continuous and categorical features.
    Definition Classes
    mllib
  • package util
    Definition Classes
    mllib

package rdd

Ordering
  1. Alphabetic
Visibility
  1. Public
  2. All

Type Members

  1. class MLPairRDDFunctions[K, V] extends Serializable

    Machine learning specific Pair RDD functions.

  2. class RDDFunctions[T] extends Serializable

    Machine learning specific RDD functions.

Value Members

  1. object MLPairRDDFunctions extends Serializable
  2. object RDDFunctions extends Serializable

Ungrouped