-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
DOC improve diabetes description #16534
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you very much, this is very useful.
Where did you find the expanded names of each variable, e.g. that "tc" stand for "a number of White Blood Cells"?
The website of the data does not give those details in the summary page:
https://www4.stat.ncsu.edu/~boos/var.select/diabetes.html
Did you fine a better reference? If so please mention the extra source at the end of the section.
No, I just searched for the description of those abbreviation in the standard blood test (so no specific reference). If you would prefer I could make it more detailed, eg: |
I'd probably replace the "white blood cells" to "T-Cells (a type of white blood cells)" |
…into diabetes_description
Thanks @maikia ! |
Reference Issues/PRs
What does this implement/fix? Explain your changes.
It has been suggested in #16155 that the features of the Diabetes dataset were not well described. Not only the names differed slightly in the explanation and in the dataset but also they were not informative: eg 's1'
This updates the description to give more meaning to the features
Any other comments?