aws · zdwolfe · Nov 27, 2017 · Nov 27, 2017
diff --git a/...plying_machine_learning/xgboost_direct_marketing/xgboost_direct_marketing_sagemaker.ipynb b/...plying_machine_learning/xgboost_direct_marketing/xgboost_direct_marketing_sagemaker.ipynb
@@ -286,7 +286,7 @@
     "\n",
     "* Handling missing values: Some machine learning algorithms are capable of handling missing values, but most would rather not.  Options include:\n",
     " * Removing observations with missing values: This works well if only a very small fraction of observations have incomplete information.\n",
-    " * Remove features with missing values: This works well if there are a small number of features which have a large number of missing values.\n",
+    " * Removing features with missing values: This works well if there are a small number of features which have a large number of missing values.\n",
     " * Imputing missing values: Entire [books](https://www.amazon.com/Flexible-Imputation-Missing-Interdisciplinary-Statistics/dp/1439868247) have been written on this topic, but common choices are replacing the missing value with the mode or mean of that column's non-missing values.\n",
     "* Converting categorical to numeric: The most common method is one hot encoding, which for each feature maps every distinct value of that column to its own feature which takes a value of 1 when the categorical feature is equal to that value, and 0 otherwise.\n",
     "* Oddly distributed data: Although for non-linear models like Gradient Boosted Trees, this has very limited implications, parametric models like regression can produce wildly inaccurate estimates when fed highly skewed data.  In some cases, simply taking the natural log of the features is sufficient to produce more normally distributed data.  In others, bucketing values into discrete ranges is helpful.  These buckets can then be treated as categorical variables and included in the model when one hot encoded.\n",