KNN: s3_input with Pipe/CSV/GZIP not working #2361
Unanswered
moebelde-rs
asked this question in
Help
Replies: 1 comment
-
Thanks for using SageMaker! I've reached out to the relevant team to see if they have any insight. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I got some strange result when trying to train the KNN estimator with a gzipped CSV (what should be possible according to the docs).
This is my input setup:
With this input the KNN only processes a fraction of samples. If the CSV contains 1k samples and sample_size is also set to this value the output is:
Here you see that only 32 samples were processed... the number also changes each time.
If I use unzipped CSVs the training works as expected...
Thanks in advance
Rob
Beta Was this translation helpful? Give feedback.
All reactions