BinaryLogisticRegressionSummary#
- class pyspark.ml.classification.BinaryLogisticRegressionSummary(java_obj=None)[source]#
- Binary Logistic regression results for a given model. - New in version 2.0.0. - Methods - fMeasureByLabel([beta])- Returns f-measure for each label (category). - weightedFMeasure([beta])- Returns weighted averaged f-measure. - Attributes - Returns accuracy. - Computes the area under the receiver operating characteristic (ROC) curve. - Returns a dataframe with two fields (threshold, F-Measure) curve with beta = 1.0. - Returns false positive rate for each label (category). - Field in "predictions" which gives the features of each instance as a vector. - Field in "predictions" which gives the true label of each instance. - Returns the sequence of labels in ascending order. - Returns the precision-recall curve, which is a Dataframe containing two fields recall, precision with (0.0, 1.0) prepended to it. - Returns precision for each label (category). - Returns a dataframe with two fields (threshold, precision) curve. - Field in "predictions" which gives the prediction of each class. - Dataframe outputted by the model's transform method. - Field in "predictions" which gives the probability of each class as a vector. - Returns recall for each label (category). - Returns a dataframe with two fields (threshold, recall) curve. - Returns the receiver operating characteristic (ROC) curve, which is a Dataframe having two fields (FPR, TPR) with (0.0, 0.0) prepended and (1.0, 1.0) appended to it. - Field in "predictions" which gives the probability or raw prediction of each class as a vector. - Returns true positive rate for each label (category). - Field in "predictions" which gives the weight of each instance as a vector. - Returns weighted false positive rate. - Returns weighted averaged precision. - Returns weighted averaged recall. - Returns weighted true positive rate. - Methods Documentation - fMeasureByLabel(beta=1.0)#
- Returns f-measure for each label (category). - New in version 3.1.0. 
 - weightedFMeasure(beta=1.0)#
- Returns weighted averaged f-measure. - New in version 3.1.0. 
 - Attributes Documentation - accuracy#
- Returns accuracy. (equals to the total number of correctly classified instances out of the total number of instances.) - New in version 3.1.0. 
 - areaUnderROC#
- Computes the area under the receiver operating characteristic (ROC) curve. - New in version 3.1.0. 
 - fMeasureByThreshold#
- Returns a dataframe with two fields (threshold, F-Measure) curve with beta = 1.0. - New in version 3.1.0. 
 - falsePositiveRateByLabel#
- Returns false positive rate for each label (category). - New in version 3.1.0. 
 - featuresCol#
- Field in “predictions” which gives the features of each instance as a vector. - New in version 2.0.0. 
 - labelCol#
- Field in “predictions” which gives the true label of each instance. - New in version 3.1.0. 
 - labels#
- Returns the sequence of labels in ascending order. This order matches the order used in metrics which are specified as arrays over labels, e.g., truePositiveRateByLabel. - New in version 3.1.0. - Notes - In most cases, it will be values {0.0, 1.0, …, numClasses-1}, However, if the training set is missing a label, then all of the arrays over labels (e.g., from truePositiveRateByLabel) will be of length numClasses-1 instead of the expected numClasses. 
 - pr#
- Returns the precision-recall curve, which is a Dataframe containing two fields recall, precision with (0.0, 1.0) prepended to it. - New in version 3.1.0. 
 - precisionByLabel#
- Returns precision for each label (category). - New in version 3.1.0. 
 - precisionByThreshold#
- Returns a dataframe with two fields (threshold, precision) curve. Every possible probability obtained in transforming the dataset are used as thresholds used in calculating the precision. - New in version 3.1.0. 
 - predictionCol#
- Field in “predictions” which gives the prediction of each class. - New in version 3.1.0. 
 - predictions#
- Dataframe outputted by the model’s transform method. - New in version 3.1.0. 
 - probabilityCol#
- Field in “predictions” which gives the probability of each class as a vector. - New in version 2.0.0. 
 - recallByLabel#
- Returns recall for each label (category). - New in version 3.1.0. 
 - recallByThreshold#
- Returns a dataframe with two fields (threshold, recall) curve. Every possible probability obtained in transforming the dataset are used as thresholds used in calculating the recall. - New in version 3.1.0. 
 - roc#
- Returns the receiver operating characteristic (ROC) curve, which is a Dataframe having two fields (FPR, TPR) with (0.0, 0.0) prepended and (1.0, 1.0) appended to it. - New in version 3.1.0. - Notes 
 - scoreCol#
- Field in “predictions” which gives the probability or raw prediction of each class as a vector. - New in version 3.1.0. 
 - truePositiveRateByLabel#
- Returns true positive rate for each label (category). - New in version 3.1.0. 
 - weightCol#
- Field in “predictions” which gives the weight of each instance as a vector. - New in version 3.1.0. 
 - weightedFalsePositiveRate#
- Returns weighted false positive rate. - New in version 3.1.0. 
 - weightedPrecision#
- Returns weighted averaged precision. - New in version 3.1.0. 
 - weightedRecall#
- Returns weighted averaged recall. (equals to precision, recall and f-measure) - New in version 3.1.0. 
 - weightedTruePositiveRate#
- Returns weighted true positive rate. (equals to precision, recall and f-measure) - New in version 3.1.0.