-
Notifications
You must be signed in to change notification settings - Fork 13
get-started: more plots, dvclive, better dataset #102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
7f5f7f6
to
efc67fd
Compare
Looks promising. May be we could keep the old code for education as a comment? wdyt? |
Hey @daavoo Is this still a draft? |
ping! @daavoo :) |
ding! :) |
Hei @iesahin , what do you think about:
Should I add this P.R. changes as a new branch? |
efc67fd
to
f8cd3b1
Compare
fd, | ||
indent=4, | ||
) | ||
live.log_plot("precision_recall", labels, predictions) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
probably it doesn't factor in this https://github.com/iterative/example-repos-dev/pull/102/files#diff-c9e81175bafc05fa2504a715d85f4fb99493314d93b2ba5b81f79fcc77cd09a7L41-L45 , right? that would make Studio slow ... and unfortunately sklearn
doesn't support dropping intermediate values yet scikit-learn/scikit-learn#21825
91976b8
to
ef39113
Compare
@@ -0,0 +1,42 @@ | |||
import io |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This script is not used in the project, this to generated datasets from the SO raw 1TB dump
@@ -17,7 +17,9 @@ popd | |||
|
|||
# Requires AWS CLI and write access to `s3://dvc-public/code/get-started/`. | |||
mv $PACKAGE_DIR/$PACKAGE . | |||
aws s3 cp --acl public-read $PACKAGE s3://dvc-public/code/get-started/$PACKAGE | |||
#aws s3 cp --acl public-read $PACKAGE s3://dvc-public/code/get-started/$PACKAGE |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it will be reverted after iterative/dataset-registry#28
@@ -60,7 +60,7 @@ git tag -a "1-dvc-init" -m "DVC initialized." | |||
|
|||
mkdir data | |||
dvc get https://github.com/iterative/dataset-registry \ | |||
get-started/data.xml -o data/data.xml | |||
get-started/data.xml -o data/data.xml --rev 95d720c467496ea6c15dd2c5d5ad48bbb631d8b1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will be reverted after iterative/dataset-registry#28
@@ -79,15 +79,16 @@ dvc push | |||
|
|||
rm data/data.xml data/data.xml.dvc | |||
dvc import https://github.com/iterative/dataset-registry \ | |||
get-started/data.xml -o data/data.xml \ | |||
get-started/data.xml -o data/data.xml --rev 95d720c467496ea6c15dd2c5d5ad48bbb631d8b1 \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will be reverted after iterative/dataset-registry#28
ef39113
to
95ce158
Compare
I've forced push multiple times in this repo, to make branch name and description less confusing closing this in favor of #114 |
EDITs by @shcheklein:
Corresponding repo to play with: https://studio.iterative.ai/user/shcheklein/views/example-get-started-r7898c5e32
(a few lines in the PR will be uncommented / reverted before actual merge, after testing)
It changes example get started:
TODO:
log_image
andlog_plot
. dvclive#189