scikit-learn
diff --git a/‎dev/_downloads/07fcc19ba03226cd3d83d4e40ec44385/auto_examples_python.zip
1.05 KB b/‎dev/_downloads/07fcc19ba03226cd3d83d4e40ec44385/auto_examples_python.zip
1.05 KB
diff --git a/‎dev/_downloads/21a6ff17ef2837fe1cd49e63223a368d/plot_unveil_tree_structure.py
Lines changed: 25 additions & 10 deletions b/‎dev/_downloads/21a6ff17ef2837fe1cd49e63223a368d/plot_unveil_tree_structure.py
Lines changed: 25 additions & 10 deletions
diff --git a/‎dev/_downloads/6f1e7a639e0699d6164445b55e6c116d/auto_examples_jupyter.zip
1.03 KB b/‎dev/_downloads/6f1e7a639e0699d6164445b55e6c116d/auto_examples_jupyter.zip
1.03 KB
diff --git a/‎dev/_downloads/f7a387851c5762610f4e8197e52bbbca/plot_unveil_tree_structure.ipynb
Lines changed: 5 additions & 5 deletions b/‎dev/_downloads/f7a387851c5762610f4e8197e52bbbca/plot_unveil_tree_structure.ipynb
Lines changed: 5 additions & 5 deletions
diff --git a/‎dev/_downloads/scikit-learn-docs.zip
2.56 KB b/‎dev/_downloads/scikit-learn-docs.zip
2.56 KB
diff --git a/‎dev/_images/sphx_glr_plot_agglomerative_clustering_001.png
-154 Bytes b/‎dev/_images/sphx_glr_plot_agglomerative_clustering_001.png
-154 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_agglomerative_clustering_thumb.png
-60 Bytes b/‎dev/_images/sphx_glr_plot_agglomerative_clustering_thumb.png
-60 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_anomaly_comparison_001.png
-126 Bytes b/‎dev/_images/sphx_glr_plot_anomaly_comparison_001.png
-126 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_anomaly_comparison_thumb.png
3 Bytes b/‎dev/_images/sphx_glr_plot_anomaly_comparison_thumb.png
3 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_caching_nearest_neighbors_001.png
101 Bytes b/‎dev/_images/sphx_glr_plot_caching_nearest_neighbors_001.png
101 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_caching_nearest_neighbors_thumb.png
-497 Bytes b/‎dev/_images/sphx_glr_plot_caching_nearest_neighbors_thumb.png
-497 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_cluster_comparison_001.png
201 Bytes b/‎dev/_images/sphx_glr_plot_cluster_comparison_001.png
201 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_cluster_comparison_thumb.png
22 Bytes b/‎dev/_images/sphx_glr_plot_cluster_comparison_thumb.png
22 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_coin_segmentation_001.png
30 Bytes b/‎dev/_images/sphx_glr_plot_coin_segmentation_001.png
30 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_coin_segmentation_002.png
-200 Bytes b/‎dev/_images/sphx_glr_plot_coin_segmentation_002.png
-200 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_coin_segmentation_003.png
-79 Bytes b/‎dev/_images/sphx_glr_plot_coin_segmentation_003.png
-79 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_coin_segmentation_thumb.png
-9 Bytes b/‎dev/_images/sphx_glr_plot_coin_segmentation_thumb.png
-9 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_dict_face_patches_001.png
-66 Bytes b/‎dev/_images/sphx_glr_plot_dict_face_patches_001.png
-66 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_dict_face_patches_thumb.png
-51 Bytes b/‎dev/_images/sphx_glr_plot_dict_face_patches_thumb.png
-51 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_document_classification_20newsgroups_005.png
-80 Bytes b/‎dev/_images/sphx_glr_plot_document_classification_20newsgroups_005.png
-80 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_document_classification_20newsgroups_006.png
-391 Bytes b/‎dev/_images/sphx_glr_plot_document_classification_20newsgroups_006.png
-391 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_document_clustering_001.png
-8 Bytes b/‎dev/_images/sphx_glr_plot_document_clustering_001.png
-8 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_document_clustering_thumb.png
-59 Bytes b/‎dev/_images/sphx_glr_plot_document_clustering_thumb.png
-59 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gmm_init_001.png
-85 Bytes b/‎dev/_images/sphx_glr_plot_gmm_init_001.png
-85 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gmm_init_thumb.png
6 Bytes b/‎dev/_images/sphx_glr_plot_gmm_init_thumb.png
6 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gradient_boosting_categorical_001.png
-897 Bytes b/‎dev/_images/sphx_glr_plot_gradient_boosting_categorical_001.png
-897 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gradient_boosting_categorical_002.png
575 Bytes b/‎dev/_images/sphx_glr_plot_gradient_boosting_categorical_002.png
575 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gradient_boosting_categorical_thumb.png
-274 Bytes b/‎dev/_images/sphx_glr_plot_gradient_boosting_categorical_thumb.png
-274 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gradient_boosting_early_stopping_001.png
-31 Bytes b/‎dev/_images/sphx_glr_plot_gradient_boosting_early_stopping_001.png
-31 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_gradient_boosting_early_stopping_thumb.png
-46 Bytes b/‎dev/_images/sphx_glr_plot_gradient_boosting_early_stopping_thumb.png
-46 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_hashing_vs_dict_vectorizer_001.png
6 Bytes b/‎dev/_images/sphx_glr_plot_hashing_vs_dict_vectorizer_001.png
6 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_hashing_vs_dict_vectorizer_002.png
4 Bytes b/‎dev/_images/sphx_glr_plot_hashing_vs_dict_vectorizer_002.png
4 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_hashing_vs_dict_vectorizer_thumb.png
15 Bytes b/‎dev/_images/sphx_glr_plot_hashing_vs_dict_vectorizer_thumb.png
15 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_hgbt_regression_001.png
-1.88 KB b/‎dev/_images/sphx_glr_plot_hgbt_regression_001.png
-1.88 KB
diff --git a/‎dev/_images/sphx_glr_plot_hgbt_regression_thumb.png
98 Bytes b/‎dev/_images/sphx_glr_plot_hgbt_regression_thumb.png
98 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_image_denoising_002.png
152 Bytes b/‎dev/_images/sphx_glr_plot_image_denoising_002.png
152 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_image_denoising_005.png
-93 Bytes b/‎dev/_images/sphx_glr_plot_image_denoising_005.png
-93 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_kernel_approximation_001.png
779 Bytes b/‎dev/_images/sphx_glr_plot_kernel_approximation_001.png
779 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_kernel_approximation_thumb.png
65 Bytes b/‎dev/_images/sphx_glr_plot_kernel_approximation_thumb.png
65 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_kernel_ridge_regression_001.png
140 Bytes b/‎dev/_images/sphx_glr_plot_kernel_ridge_regression_001.png
140 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_kernel_ridge_regression_002.png
21 Bytes b/‎dev/_images/sphx_glr_plot_kernel_ridge_regression_002.png
21 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_kernel_ridge_regression_thumb.png
7 Bytes b/‎dev/_images/sphx_glr_plot_kernel_ridge_regression_thumb.png
7 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lasso_model_selection_002.png
-2 Bytes b/‎dev/_images/sphx_glr_plot_lasso_model_selection_002.png
-2 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lasso_model_selection_003.png
68 Bytes b/‎dev/_images/sphx_glr_plot_lasso_model_selection_003.png
68 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_learning_curve_002.png
57 Bytes b/‎dev/_images/sphx_glr_plot_learning_curve_002.png
57 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_learning_curve_003.png
-2 KB b/‎dev/_images/sphx_glr_plot_learning_curve_003.png
-2 KB
diff --git a/‎dev/_images/sphx_glr_plot_linear_model_coefficient_interpretation_001.png
-278 Bytes b/‎dev/_images/sphx_glr_plot_linear_model_coefficient_interpretation_001.png
-278 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_linear_model_coefficient_interpretation_thumb.png
-129 Bytes b/‎dev/_images/sphx_glr_plot_linear_model_coefficient_interpretation_thumb.png
-129 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_linkage_comparison_001.png
559 Bytes b/‎dev/_images/sphx_glr_plot_linkage_comparison_001.png
559 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_linkage_comparison_thumb.png
-4 Bytes b/‎dev/_images/sphx_glr_plot_linkage_comparison_thumb.png
-4 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_002.png
75 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_002.png
75 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_004.png
88 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_004.png
88 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_005.png
-40 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_005.png
-40 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_006.png
47 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_006.png
47 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_007.png
-96 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_007.png
-96 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_008.png
17 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_008.png
17 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_009.png
15 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_009.png
15 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_010.png
-159 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_010.png
-159 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_011.png
-28 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_011.png
-28 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_012.png
28 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_012.png
28 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_013.png
78 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_013.png
78 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_lle_digits_014.png
89 Bytes b/‎dev/_images/sphx_glr_plot_lle_digits_014.png
89 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_manifold_sphere_001.png
423 Bytes b/‎dev/_images/sphx_glr_plot_manifold_sphere_001.png
423 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_manifold_sphere_thumb.png
43 Bytes b/‎dev/_images/sphx_glr_plot_manifold_sphere_thumb.png
43 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_model_complexity_influence_001.png
-885 Bytes b/‎dev/_images/sphx_glr_plot_model_complexity_influence_001.png
-885 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_model_complexity_influence_002.png
553 Bytes b/‎dev/_images/sphx_glr_plot_model_complexity_influence_002.png
553 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_model_complexity_influence_003.png
69 Bytes b/‎dev/_images/sphx_glr_plot_model_complexity_influence_003.png
69 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_model_complexity_influence_thumb.png
-350 Bytes b/‎dev/_images/sphx_glr_plot_model_complexity_influence_thumb.png
-350 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_multiclass_overview_001.png
310 Bytes b/‎dev/_images/sphx_glr_plot_multiclass_overview_001.png
310 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_multiclass_overview_002.png
15 Bytes b/‎dev/_images/sphx_glr_plot_multiclass_overview_002.png
15 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_multiclass_overview_thumb.png
133 Bytes b/‎dev/_images/sphx_glr_plot_multiclass_overview_thumb.png
133 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_out_of_core_classification_002.png
631 Bytes b/‎dev/_images/sphx_glr_plot_out_of_core_classification_002.png
631 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_out_of_core_classification_003.png
638 Bytes b/‎dev/_images/sphx_glr_plot_out_of_core_classification_003.png
638 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_out_of_core_classification_004.png
-327 Bytes b/‎dev/_images/sphx_glr_plot_out_of_core_classification_004.png
-327 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_prediction_latency_001.png
230 Bytes b/‎dev/_images/sphx_glr_plot_prediction_latency_001.png
230 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_prediction_latency_002.png
449 Bytes b/‎dev/_images/sphx_glr_plot_prediction_latency_002.png
449 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_prediction_latency_003.png
-1.15 KB b/‎dev/_images/sphx_glr_plot_prediction_latency_003.png
-1.15 KB
diff --git a/‎dev/_images/sphx_glr_plot_prediction_latency_004.png
-24 Bytes b/‎dev/_images/sphx_glr_plot_prediction_latency_004.png
-24 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_prediction_latency_thumb.png
148 Bytes b/‎dev/_images/sphx_glr_plot_prediction_latency_thumb.png
148 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_release_highlights_0_24_0_001.png
279 Bytes b/‎dev/_images/sphx_glr_plot_release_highlights_0_24_0_001.png
279 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_release_highlights_0_24_0_thumb.png
72 Bytes b/‎dev/_images/sphx_glr_plot_release_highlights_0_24_0_thumb.png
72 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_release_highlights_1_4_0_001.png
33 Bytes b/‎dev/_images/sphx_glr_plot_release_highlights_1_4_0_001.png
33 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_release_highlights_1_4_0_thumb.png
-96 Bytes b/‎dev/_images/sphx_glr_plot_release_highlights_1_4_0_thumb.png
-96 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_scalable_poly_kernels_001.png
-18 Bytes b/‎dev/_images/sphx_glr_plot_scalable_poly_kernels_001.png
-18 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_scalable_poly_kernels_thumb.png
37 Bytes b/‎dev/_images/sphx_glr_plot_scalable_poly_kernels_thumb.png
37 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_sgd_early_stopping_002.png
2.07 KB b/‎dev/_images/sphx_glr_plot_sgd_early_stopping_002.png
2.07 KB
diff --git a/‎dev/_images/sphx_glr_plot_sparse_logistic_regression_20newsgroups_001.png
56 Bytes b/‎dev/_images/sphx_glr_plot_sparse_logistic_regression_20newsgroups_001.png
56 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_sparse_logistic_regression_20newsgroups_thumb.png
207 Bytes b/‎dev/_images/sphx_glr_plot_sparse_logistic_regression_20newsgroups_thumb.png
207 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_stack_predictors_001.png
948 Bytes b/‎dev/_images/sphx_glr_plot_stack_predictors_001.png
948 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_stack_predictors_thumb.png
58 Bytes b/‎dev/_images/sphx_glr_plot_stack_predictors_thumb.png
58 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_successive_halving_heatmap_001.png
74 Bytes b/‎dev/_images/sphx_glr_plot_successive_halving_heatmap_001.png
74 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_successive_halving_heatmap_thumb.png
6 Bytes b/‎dev/_images/sphx_glr_plot_successive_halving_heatmap_thumb.png
6 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_svm_scale_c_001.png
108 Bytes b/‎dev/_images/sphx_glr_plot_svm_scale_c_001.png
108 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_svm_scale_c_thumb.png
-18 Bytes b/‎dev/_images/sphx_glr_plot_svm_scale_c_thumb.png
-18 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_theilsen_001.png
24 Bytes b/‎dev/_images/sphx_glr_plot_theilsen_001.png
24 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_theilsen_002.png
91 Bytes b/‎dev/_images/sphx_glr_plot_theilsen_002.png
91 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_theilsen_thumb.png
2 Bytes b/‎dev/_images/sphx_glr_plot_theilsen_thumb.png
2 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_unveil_tree_structure_001.png
-1.17 KB b/‎dev/_images/sphx_glr_plot_unveil_tree_structure_001.png
-1.17 KB
diff --git a/‎dev/_images/sphx_glr_plot_unveil_tree_structure_thumb.png
-1.4 KB b/‎dev/_images/sphx_glr_plot_unveil_tree_structure_thumb.png
-1.4 KB
diff --git a/‎dev/_images/sphx_glr_plot_ward_structured_vs_unstructured_001.png
21 Bytes b/‎dev/_images/sphx_glr_plot_ward_structured_vs_unstructured_001.png
21 Bytes
diff --git a/‎dev/_images/sphx_glr_plot_ward_structured_vs_unstructured_thumb.png
14 Bytes b/‎dev/_images/sphx_glr_plot_ward_structured_vs_unstructured_thumb.png
14 Bytes
diff --git a/‎dev/_sources/auto_examples/applications/plot_cyclical_feature_engineering.rst.txt
Lines changed: 1 addition & 1 deletion b/‎dev/_sources/auto_examples/applications/plot_cyclical_feature_engineering.rst.txt
Lines changed: 1 addition & 1 deletion
diff --git a/‎dev/_sources/auto_examples/applications/plot_digits_denoising.rst.txt
Lines changed: 1 addition & 1 deletion b/‎dev/_sources/auto_examples/applications/plot_digits_denoising.rst.txt
Lines changed: 1 addition & 1 deletion
diff --git a/‎dev/_sources/auto_examples/applications/plot_face_recognition.rst.txt
Lines changed: 5 additions & 5 deletions b/‎dev/_sources/auto_examples/applications/plot_face_recognition.rst.txt
Lines changed: 5 additions & 5 deletions
diff --git a/‎dev/_sources/auto_examples/applications/plot_model_complexity_influence.rst.txt
Lines changed: 15 additions & 15 deletions b/‎dev/_sources/auto_examples/applications/plot_model_complexity_influence.rst.txt
Lines changed: 15 additions & 15 deletions
@@ -68,7 +68,8 @@
 #   - ``weighted_n_node_samples[i]``: the weighted number of training samples
 #     reaching node ``i``
 #   - ``value[i, j, k]``: the summary of the training samples that reached node i for
-#     output j and class k (for regression tree, class is set to 1).
+#     output j and class k (for regression tree, class is set to 1). See below
+#     for more information about ``value``.
 #
 # Using the arrays, we can traverse the tree structure to compute various
 # properties. Below, we will compute the depth of each node and whether or not
@@ -108,7 +109,7 @@
     if is_leaves[i]:
         print(
             "{space}node={node} is a leaf node with value={value}.".format(
-                space=node_depth[i] * "\t", node=i, value=values[i]
+                space=node_depth[i] * "\t", node=i, value=np.around(values[i], 3)
             )
         )
     else:
@@ -122,24 +123,36 @@
                 feature=feature[i],
                 threshold=threshold[i],
                 right=children_right[i],
-                value=values[i],
+                value=np.around(values[i], 3),
             )
         )
 
 # %%
 # What is the values array used here?
 # -----------------------------------
 # The `tree_.value` array is a 3D array of shape
-# [``n_nodes``, ``n_classes``, ``n_outputs``] which provides the count of samples
-# reaching a node for each class and for each output. Each node has a ``value``
-# array which is the number of weighted samples reaching this
-# node for each output and class.
+# [``n_nodes``, ``n_classes``, ``n_outputs``] which provides the proportion of samples
+# reaching a node for each class and for each output.
+# Each node has a ``value`` array which is the proportion of weighted samples reaching
+# this node for each output and class with respect to the parent node.
+#
+# One could convert this to the absolute weighted number of samples reaching a node,
+# by multiplying this number by `tree_.weighted_n_node_samples[node_idx]` for the
+# given node. Note sample weights are not used in this example, so the weighted
+# number of samples is the number of samples reaching the node because each sample
+# has a weight of 1 by default.
 #
 # For example, in the above tree built on the iris dataset, the root node has
-# ``value = [37, 34, 41]``, indicating there are 37 samples
+# ``value = [0.33, 0.304, 0.366]`` indicating there are 33% of class 0 samples,
+# 30.4% of class 1 samples, and 36.6% of class 2 samples at the root node. One can
+# convert this to the absolute number of samples by multiplying by the number of
+# samples reaching the root node, which is `tree_.weighted_n_node_samples[0]`.
+# Then the root node has ``value = [37, 34, 41]``, indicating there are 37 samples
 # of class 0, 34 samples of class 1, and 41 samples of class 2 at the root node.
+#
 # Traversing the tree, the samples are split and as a result, the ``value`` array
-# reaching each node changes. The left child of the root node has ``value = [37, 0, 0]``
+# reaching each node changes. The left child of the root node has ``value = [1., 0, 0]``
+# (or ``value = [37, 0, 0]`` when converted to the absolute number of samples)
 # because all 37 samples in the left child node are from class 0.
 #
 # Note: In this example, `n_outputs=1`, but the tree classifier can also handle
@@ -148,8 +161,10 @@
 
 ##############################################################################
 # We can compare the above output to the plot of the decision tree.
+# Here, we show the proportions of samples of each class that reach each
+# node corresponding to the actual elements of `tree_.value` array.
 
-tree.plot_tree(clf)
+tree.plot_tree(clf, proportion=True)
 plt.show()
 
 ##############################################################################
 
@@ -40,7 +40,7 @@
       "cell_type": "markdown",
       "metadata": {},
       "source": [
-        "## Tree structure\n\nThe decision classifier has an attribute called ``tree_`` which allows access\nto low level attributes such as ``node_count``, the total number of nodes,\nand ``max_depth``, the maximal depth of the tree. The\n``tree_.compute_node_depths()`` method computes the depth of each node in the\ntree. `tree_` also stores the entire binary tree structure, represented as a\nnumber of parallel arrays. The i-th element of each array holds information\nabout the node ``i``. Node 0 is the tree's root. Some of the arrays only\napply to either leaves or split nodes. In this case the values of the nodes\nof the other type is arbitrary. For example, the arrays ``feature`` and\n``threshold`` only apply to split nodes. The values for leaf nodes in these\narrays are therefore arbitrary.\n\nAmong these arrays, we have:\n\n  - ``children_left[i]``: id of the left child of node ``i`` or -1 if leaf\n    node\n  - ``children_right[i]``: id of the right child of node ``i`` or -1 if leaf\n    node\n  - ``feature[i]``: feature used for splitting node ``i``\n  - ``threshold[i]``: threshold value at node ``i``\n  - ``n_node_samples[i]``: the number of training samples reaching node\n    ``i``\n  - ``impurity[i]``: the impurity at node ``i``\n  - ``weighted_n_node_samples[i]``: the weighted number of training samples\n    reaching node ``i``\n  - ``value[i, j, k]``: the summary of the training samples that reached node i for\n    output j and class k (for regression tree, class is set to 1).\n\nUsing the arrays, we can traverse the tree structure to compute various\nproperties. Below, we will compute the depth of each node and whether or not\nit is a leaf.\n\n"
+        "## Tree structure\n\nThe decision classifier has an attribute called ``tree_`` which allows access\nto low level attributes such as ``node_count``, the total number of nodes,\nand ``max_depth``, the maximal depth of the tree. The\n``tree_.compute_node_depths()`` method computes the depth of each node in the\ntree. `tree_` also stores the entire binary tree structure, represented as a\nnumber of parallel arrays. The i-th element of each array holds information\nabout the node ``i``. Node 0 is the tree's root. Some of the arrays only\napply to either leaves or split nodes. In this case the values of the nodes\nof the other type is arbitrary. For example, the arrays ``feature`` and\n``threshold`` only apply to split nodes. The values for leaf nodes in these\narrays are therefore arbitrary.\n\nAmong these arrays, we have:\n\n  - ``children_left[i]``: id of the left child of node ``i`` or -1 if leaf\n    node\n  - ``children_right[i]``: id of the right child of node ``i`` or -1 if leaf\n    node\n  - ``feature[i]``: feature used for splitting node ``i``\n  - ``threshold[i]``: threshold value at node ``i``\n  - ``n_node_samples[i]``: the number of training samples reaching node\n    ``i``\n  - ``impurity[i]``: the impurity at node ``i``\n  - ``weighted_n_node_samples[i]``: the weighted number of training samples\n    reaching node ``i``\n  - ``value[i, j, k]``: the summary of the training samples that reached node i for\n    output j and class k (for regression tree, class is set to 1). See below\n    for more information about ``value``.\n\nUsing the arrays, we can traverse the tree structure to compute various\nproperties. Below, we will compute the depth of each node and whether or not\nit is a leaf.\n\n"
       ]
     },
     {
@@ -51,21 +51,21 @@
       },
       "outputs": [],
       "source": [
-        "n_nodes = clf.tree_.node_count\nchildren_left = clf.tree_.children_left\nchildren_right = clf.tree_.children_right\nfeature = clf.tree_.feature\nthreshold = clf.tree_.threshold\nvalues = clf.tree_.value\n\nnode_depth = np.zeros(shape=n_nodes, dtype=np.int64)\nis_leaves = np.zeros(shape=n_nodes, dtype=bool)\nstack = [(0, 0)]  # start with the root node id (0) and its depth (0)\nwhile len(stack) > 0:\n    # `pop` ensures each node is only visited once\n    node_id, depth = stack.pop()\n    node_depth[node_id] = depth\n\n    # If the left and right child of a node is not the same we have a split\n    # node\n    is_split_node = children_left[node_id] != children_right[node_id]\n    # If a split node, append left and right children and depth to `stack`\n    # so we can loop through them\n    if is_split_node:\n        stack.append((children_left[node_id], depth + 1))\n        stack.append((children_right[node_id], depth + 1))\n    else:\n        is_leaves[node_id] = True\n\nprint(\n    \"The binary tree structure has {n} nodes and has \"\n    \"the following tree structure:\\n\".format(n=n_nodes)\n)\nfor i in range(n_nodes):\n    if is_leaves[i]:\n        print(\n            \"{space}node={node} is a leaf node with value={value}.\".format(\n                space=node_depth[i] * \"\\t\", node=i, value=values[i]\n            )\n        )\n    else:\n        print(\n            \"{space}node={node} is a split node with value={value}: \"\n            \"go to node {left} if X[:, {feature}] <= {threshold} \"\n            \"else to node {right}.\".format(\n                space=node_depth[i] * \"\\t\",\n                node=i,\n                left=children_left[i],\n                feature=feature[i],\n                threshold=threshold[i],\n                right=children_right[i],\n                value=values[i],\n            )\n        )"
+        "n_nodes = clf.tree_.node_count\nchildren_left = clf.tree_.children_left\nchildren_right = clf.tree_.children_right\nfeature = clf.tree_.feature\nthreshold = clf.tree_.threshold\nvalues = clf.tree_.value\n\nnode_depth = np.zeros(shape=n_nodes, dtype=np.int64)\nis_leaves = np.zeros(shape=n_nodes, dtype=bool)\nstack = [(0, 0)]  # start with the root node id (0) and its depth (0)\nwhile len(stack) > 0:\n    # `pop` ensures each node is only visited once\n    node_id, depth = stack.pop()\n    node_depth[node_id] = depth\n\n    # If the left and right child of a node is not the same we have a split\n    # node\n    is_split_node = children_left[node_id] != children_right[node_id]\n    # If a split node, append left and right children and depth to `stack`\n    # so we can loop through them\n    if is_split_node:\n        stack.append((children_left[node_id], depth + 1))\n        stack.append((children_right[node_id], depth + 1))\n    else:\n        is_leaves[node_id] = True\n\nprint(\n    \"The binary tree structure has {n} nodes and has \"\n    \"the following tree structure:\\n\".format(n=n_nodes)\n)\nfor i in range(n_nodes):\n    if is_leaves[i]:\n        print(\n            \"{space}node={node} is a leaf node with value={value}.\".format(\n                space=node_depth[i] * \"\\t\", node=i, value=np.around(values[i], 3)\n            )\n        )\n    else:\n        print(\n            \"{space}node={node} is a split node with value={value}: \"\n            \"go to node {left} if X[:, {feature}] <= {threshold} \"\n            \"else to node {right}.\".format(\n                space=node_depth[i] * \"\\t\",\n                node=i,\n                left=children_left[i],\n                feature=feature[i],\n                threshold=threshold[i],\n                right=children_right[i],\n                value=np.around(values[i], 3),\n            )\n        )"
       ]
     },
     {
       "cell_type": "markdown",
       "metadata": {},
       "source": [
-        "## What is the values array used here?\nThe `tree_.value` array is a 3D array of shape\n[``n_nodes``, ``n_classes``, ``n_outputs``] which provides the count of samples\nreaching a node for each class and for each output. Each node has a ``value``\narray which is the number of weighted samples reaching this\nnode for each output and class.\n\nFor example, in the above tree built on the iris dataset, the root node has\n``value = [37, 34, 41]``, indicating there are 37 samples\nof class 0, 34 samples of class 1, and 41 samples of class 2 at the root node.\nTraversing the tree, the samples are split and as a result, the ``value`` array\nreaching each node changes. The left child of the root node has ``value = [37, 0, 0]``\nbecause all 37 samples in the left child node are from class 0.\n\nNote: In this example, `n_outputs=1`, but the tree classifier can also handle\nmulti-output problems. The `value` array at each node would just be a 2D\narray instead.\n\n"
+        "## What is the values array used here?\nThe `tree_.value` array is a 3D array of shape\n[``n_nodes``, ``n_classes``, ``n_outputs``] which provides the proportion of samples\nreaching a node for each class and for each output.\nEach node has a ``value`` array which is the proportion of weighted samples reaching\nthis node for each output and class with respect to the parent node.\n\nOne could convert this to the absolute weighted number of samples reaching a node,\nby multiplying this number by `tree_.weighted_n_node_samples[node_idx]` for the\ngiven node. Note sample weights are not used in this example, so the weighted\nnumber of samples is the number of samples reaching the node because each sample\nhas a weight of 1 by default.\n\nFor example, in the above tree built on the iris dataset, the root node has\n``value = [0.33, 0.304, 0.366]`` indicating there are 33% of class 0 samples,\n30.4% of class 1 samples, and 36.6% of class 2 samples at the root node. One can\nconvert this to the absolute number of samples by multiplying by the number of\nsamples reaching the root node, which is `tree_.weighted_n_node_samples[0]`.\nThen the root node has ``value = [37, 34, 41]``, indicating there are 37 samples\nof class 0, 34 samples of class 1, and 41 samples of class 2 at the root node.\n\nTraversing the tree, the samples are split and as a result, the ``value`` array\nreaching each node changes. The left child of the root node has ``value = [1., 0, 0]``\n(or ``value = [37, 0, 0]`` when converted to the absolute number of samples)\nbecause all 37 samples in the left child node are from class 0.\n\nNote: In this example, `n_outputs=1`, but the tree classifier can also handle\nmulti-output problems. The `value` array at each node would just be a 2D\narray instead.\n\n"
       ]
     },
     {
       "cell_type": "markdown",
       "metadata": {},
       "source": [
-        "We can compare the above output to the plot of the decision tree.\n\n"
+        "We can compare the above output to the plot of the decision tree.\nHere, we show the proportions of samples of each class that reach each\nnode corresponding to the actual elements of `tree_.value` array.\n\n"
       ]
     },
     {
@@ -76,7 +76,7 @@
       },
       "outputs": [],
       "source": [
-        "tree.plot_tree(clf)\nplt.show()"
+        "tree.plot_tree(clf, proportion=True)\nplt.show()"
       ]
     },
     {
Original file line number	Diff line number	Diff line change
`@@ -40,7 +40,7 @@`
`40`	`40`	`"cell_type": "markdown",`
`41`	`41`	`"metadata": {},`
`42`	`42`	`"source": [`
`43`		- "## Tree structure\n\nThe decision classifier has an attribute called ``tree_`` which allows access\nto low level attributes such as ``node_count``, the total number of nodes,\nand ``max_depth``, the maximal depth of the tree. The\n``tree_.compute_node_depths()`` method computes the depth of each node in the\ntree. `tree_` also stores the entire binary tree structure, represented as a\nnumber of parallel arrays. The i-th element of each array holds information\nabout the node ``i``. Node 0 is the tree's root. Some of the arrays only\napply to either leaves or split nodes. In this case the values of the nodes\nof the other type is arbitrary. For example, the arrays ``feature`` and\n``threshold`` only apply to split nodes. The values for leaf nodes in these\narrays are therefore arbitrary.\n\nAmong these arrays, we have:\n\n - ``children_left[i]``: id of the left child of node ``i`` or -1 if leaf\n node\n - ``children_right[i]``: id of the right child of node ``i`` or -1 if leaf\n node\n - ``feature[i]``: feature used for splitting node ``i``\n - ``threshold[i]``: threshold value at node ``i``\n - ``n_node_samples[i]``: the number of training samples reaching node\n ``i``\n - ``impurity[i]``: the impurity at node ``i``\n - ``weighted_n_node_samples[i]``: the weighted number of training samples\n reaching node ``i``\n - ``value[i, j, k]``: the summary of the training samples that reached node i for\n output j and class k (for regression tree, class is set to 1).\n\nUsing the arrays, we can traverse the tree structure to compute various\nproperties. Below, we will compute the depth of each node and whether or not\nit is a leaf.\n\n"
	`43`	+ "## Tree structure\n\nThe decision classifier has an attribute called ``tree_`` which allows access\nto low level attributes such as ``node_count``, the total number of nodes,\nand ``max_depth``, the maximal depth of the tree. The\n``tree_.compute_node_depths()`` method computes the depth of each node in the\ntree. `tree_` also stores the entire binary tree structure, represented as a\nnumber of parallel arrays. The i-th element of each array holds information\nabout the node ``i``. Node 0 is the tree's root. Some of the arrays only\napply to either leaves or split nodes. In this case the values of the nodes\nof the other type is arbitrary. For example, the arrays ``feature`` and\n``threshold`` only apply to split nodes. The values for leaf nodes in these\narrays are therefore arbitrary.\n\nAmong these arrays, we have:\n\n - ``children_left[i]``: id of the left child of node ``i`` or -1 if leaf\n node\n - ``children_right[i]``: id of the right child of node ``i`` or -1 if leaf\n node\n - ``feature[i]``: feature used for splitting node ``i``\n - ``threshold[i]``: threshold value at node ``i``\n - ``n_node_samples[i]``: the number of training samples reaching node\n ``i``\n - ``impurity[i]``: the impurity at node ``i``\n - ``weighted_n_node_samples[i]``: the weighted number of training samples\n reaching node ``i``\n - ``value[i, j, k]``: the summary of the training samples that reached node i for\n output j and class k (for regression tree, class is set to 1). See below\n for more information about ``value``.\n\nUsing the arrays, we can traverse the tree structure to compute various\nproperties. Below, we will compute the depth of each node and whether or not\nit is a leaf.\n\n"
`44`	`44`	`]`
`45`	`45`	`},`
`46`	`46`	`{`
`@@ -51,21 +51,21 @@`
`51`	`51`	`},`
`52`	`52`	`"outputs": [],`
`53`	`53`	`"source": [`
`54`		- "n_nodes = clf.tree_.node_count\nchildren_left = clf.tree_.children_left\nchildren_right = clf.tree_.children_right\nfeature = clf.tree_.feature\nthreshold = clf.tree_.threshold\nvalues = clf.tree_.value\n\nnode_depth = np.zeros(shape=n_nodes, dtype=np.int64)\nis_leaves = np.zeros(shape=n_nodes, dtype=bool)\nstack = [(0, 0)] # start with the root node id (0) and its depth (0)\nwhile len(stack) > 0:\n # `pop` ensures each node is only visited once\n node_id, depth = stack.pop()\n node_depth[node_id] = depth\n\n # If the left and right child of a node is not the same we have a split\n # node\n is_split_node = children_left[node_id] != children_right[node_id]\n # If a split node, append left and right children and depth to `stack`\n # so we can loop through them\n if is_split_node:\n stack.append((children_left[node_id], depth + 1))\n stack.append((children_right[node_id], depth + 1))\n else:\n is_leaves[node_id] = True\n\nprint(\n \"The binary tree structure has {n} nodes and has \"\n \"the following tree structure:\\n\".format(n=n_nodes)\n)\nfor i in range(n_nodes):\n if is_leaves[i]:\n print(\n \"{space}node={node} is a leaf node with value={value}.\".format(\n space=node_depth[i] * \"\\t\", node=i, value=values[i]\n )\n )\n else:\n print(\n \"{space}node={node} is a split node with value={value}: \"\n \"go to node {left} if X[:, {feature}] <= {threshold} \"\n \"else to node {right}.\".format(\n space=node_depth[i] * \"\\t\",\n node=i,\n left=children_left[i],\n feature=feature[i],\n threshold=threshold[i],\n right=children_right[i],\n value=values[i],\n )\n )"
	`54`	+ "n_nodes = clf.tree_.node_count\nchildren_left = clf.tree_.children_left\nchildren_right = clf.tree_.children_right\nfeature = clf.tree_.feature\nthreshold = clf.tree_.threshold\nvalues = clf.tree_.value\n\nnode_depth = np.zeros(shape=n_nodes, dtype=np.int64)\nis_leaves = np.zeros(shape=n_nodes, dtype=bool)\nstack = [(0, 0)] # start with the root node id (0) and its depth (0)\nwhile len(stack) > 0:\n # `pop` ensures each node is only visited once\n node_id, depth = stack.pop()\n node_depth[node_id] = depth\n\n # If the left and right child of a node is not the same we have a split\n # node\n is_split_node = children_left[node_id] != children_right[node_id]\n # If a split node, append left and right children and depth to `stack`\n # so we can loop through them\n if is_split_node:\n stack.append((children_left[node_id], depth + 1))\n stack.append((children_right[node_id], depth + 1))\n else:\n is_leaves[node_id] = True\n\nprint(\n \"The binary tree structure has {n} nodes and has \"\n \"the following tree structure:\\n\".format(n=n_nodes)\n)\nfor i in range(n_nodes):\n if is_leaves[i]:\n print(\n \"{space}node={node} is a leaf node with value={value}.\".format(\n space=node_depth[i] * \"\\t\", node=i, value=np.around(values[i], 3)\n )\n )\n else:\n print(\n \"{space}node={node} is a split node with value={value}: \"\n \"go to node {left} if X[:, {feature}] <= {threshold} \"\n \"else to node {right}.\".format(\n space=node_depth[i] * \"\\t\",\n node=i,\n left=children_left[i],\n feature=feature[i],\n threshold=threshold[i],\n right=children_right[i],\n value=np.around(values[i], 3),\n )\n )"
`55`	`55`	`]`
`56`	`56`	`},`
`57`	`57`	`{`
`58`	`58`	`"cell_type": "markdown",`
`59`	`59`	`"metadata": {},`
`60`	`60`	`"source": [`
`61`		- "## What is the values array used here?\nThe `tree_.value` array is a 3D array of shape\n[``n_nodes``, ``n_classes``, ``n_outputs``] which provides the count of samples\nreaching a node for each class and for each output. Each node has a ``value``\narray which is the number of weighted samples reaching this\nnode for each output and class.\n\nFor example, in the above tree built on the iris dataset, the root node has\n``value = [37, 34, 41]``, indicating there are 37 samples\nof class 0, 34 samples of class 1, and 41 samples of class 2 at the root node.\nTraversing the tree, the samples are split and as a result, the ``value`` array\nreaching each node changes. The left child of the root node has ``value = [37, 0, 0]``\nbecause all 37 samples in the left child node are from class 0.\n\nNote: In this example, `n_outputs=1`, but the tree classifier can also handle\nmulti-output problems. The `value` array at each node would just be a 2D\narray instead.\n\n"
	`61`	+ "## What is the values array used here?\nThe `tree_.value` array is a 3D array of shape\n[``n_nodes``, ``n_classes``, ``n_outputs``] which provides the proportion of samples\nreaching a node for each class and for each output.\nEach node has a ``value`` array which is the proportion of weighted samples reaching\nthis node for each output and class with respect to the parent node.\n\nOne could convert this to the absolute weighted number of samples reaching a node,\nby multiplying this number by `tree_.weighted_n_node_samples[node_idx]` for the\ngiven node. Note sample weights are not used in this example, so the weighted\nnumber of samples is the number of samples reaching the node because each sample\nhas a weight of 1 by default.\n\nFor example, in the above tree built on the iris dataset, the root node has\n``value = [0.33, 0.304, 0.366]`` indicating there are 33% of class 0 samples,\n30.4% of class 1 samples, and 36.6% of class 2 samples at the root node. One can\nconvert this to the absolute number of samples by multiplying by the number of\nsamples reaching the root node, which is `tree_.weighted_n_node_samples[0]`.\nThen the root node has ``value = [37, 34, 41]``, indicating there are 37 samples\nof class 0, 34 samples of class 1, and 41 samples of class 2 at the root node.\n\nTraversing the tree, the samples are split and as a result, the ``value`` array\nreaching each node changes. The left child of the root node has ``value = [1., 0, 0]``\n(or ``value = [37, 0, 0]`` when converted to the absolute number of samples)\nbecause all 37 samples in the left child node are from class 0.\n\nNote: In this example, `n_outputs=1`, but the tree classifier can also handle\nmulti-output problems. The `value` array at each node would just be a 2D\narray instead.\n\n"
`62`	`62`	`]`
`63`	`63`	`},`
`64`	`64`	`{`
`65`	`65`	`"cell_type": "markdown",`
`66`	`66`	`"metadata": {},`
`67`	`67`	`"source": [`
`68`		`- "We can compare the above output to the plot of the decision tree.\n\n"`
	`68`	+ "We can compare the above output to the plot of the decision tree.\nHere, we show the proportions of samples of each class that reach each\nnode corresponding to the actual elements of `tree_.value` array.\n\n"
`69`	`69`	`]`
`70`	`70`	`},`
`71`	`71`	`{`
`@@ -76,7 +76,7 @@`
`76`	`76`	`},`
`77`	`77`	`"outputs": [],`
`78`	`78`	`"source": [`
`79`		`- "tree.plot_tree(clf)\nplt.show()"`
	`79`	`+ "tree.plot_tree(clf, proportion=True)\nplt.show()"`
`80`	`80`	`]`
`81`	`81`	`},`
`82`	`82`	`{`