Description
Hey Folks!
I thought it might be a bit easier if we had one issue that had the current "state of the world".
It would have a list of all completed Estimators/Functions and next to each it would have a person's name if someone was working on it or it'd be checked if it was complete and merged in dev.
Ping me in the comments beneath and I'll add you to whichever estimators you want to work on.
I went through the scikit-learn docs yesterday and broke out the Estimators that we would need for an MVP of scikit.js (let's call it version 1).
Version 1
The focus here is on simple models, and all the preprocessing, and metrics that you'd need to perform high quality model generation.
linear_model
- LinearRegression
- LassoRegression
- RidgeRegression
- ElasticNet
- LogisticRegression
- SGDClassifier
- SGDRegressor
cluster
- KMeans
neighbors
dummy
- DummyClassifier
- DummyRegressor
impute
- SimpleImputer
preprocessing
- StandardScaler
- MinMaxScaler
- MaxAbsScaler
- Normalizer
- RobustScaler
- LabelEncoder
- OneHotEncoder
- OrdinalEncoder
pipeline
- Pipeline
compose
- ColumnTransformer
tree
metrics
- accuracyScore
- confusionMatrix
- hingeLoss
- logLoss
- precisionScore
- recallScore
- rocAucScore
- zeroOneLoss
- meanAbsoluteError
- meanSquaredError
- meanSquaredLogError
- r2Score
So pick whichever ya want, and ping me, and I'll update the issue and put your name next to the Estimator / Function.
Some great resources for contributors
- ML from scratch in Python : https://github.com/eriklindernoren/ML-From-Scratch
- Nick Leclure's Book: https://github.com/nfmcclure/tensorflow_cookbook
- Charlie Gerard's Book: Practical Machine Learning with Tensorflow.js
- MachineLearnjs : https://github.com/machinelearnjs/machinelearnjs
Hello folks! Time flies when you're having fun :)
We are rounding the corner the completion of the MVP / Version 1 list above. I thought it would be good to go through scikit-learn and make a list of the next most important things. That list is below as well as some general todos (docs, tutorials). Feel free to ping me or comment below and grab whatever interests in the following list.
Onward and Upward!
linear_model
- Exact solution for linear_regression
datasets
- Iris
- Boston Housing
- Put Common datasets in scikitjs #44
-
make_classification
/make_regression
from scikit-learn #50
naive_bayes
svm
- LinearSVC
- LinearSVR
- SVC
- SVR
model_selection
- GroupKFold
- Implement KFold #46
- ShuffleSplit
- train_test_split #45
decomposition
- PCA
hyper_parameter
ensemble
- VotingRegressor
- VotingClassifier
- RandomForestClassifier
- RandomForestRegressor
docs
- Make Basic Docs site
- Push the Basic Docs site to scikit.org. Have scikit.js redirect to scikit.org
- Make Basic Docs site show api for all functions / classes that we export
- Make it build browser and node versions
- Make the tests run against browser and node environments