Stream notes
Video
Here’s the VOD:
Below is an embed
Summary
- Intro
- Admin and follow up on yesterday’s to dos
- Coding
- Fitting Random Forests
- lattjorr asked a cool question: How do I compare two linear regression models? one cuts the amount of variables used by 2/3 but how to explain which is better using this Telco Customer Churn dataset
- Crono’s business answer: the model that is easier to explain to the people paying you and/or the one that captures most of the true behavior with less data
- This Quora discussion is better because there’s actual processes
- Found a discussion of F statistic that had improper language about p-value meaning and that derailed and talked about the controversy around p-values
- Related to my business focused answer above was a reddit post asking about what 20% tools people are using to do 80% of their work
- Crono’s business answer: the model that is easier to explain to the people paying you and/or the one that captures most of the true behavior with less data
- Suggested checking out the subreddits for datascience and statistics
- kmode has an exam in a super interesting subject: Uncertainty quantification
- Cat struggles to drink water from a faucet
- Want to chill out? Watch capybara
- Music
- We got raided by R4D4R_Live!
- We raided AppleGlass (they/them)
Shoutouts
Streamers who were active in chat
To Do
Code
- [ ] How to change wordpress endpoint, specifically /admin
- [ ] send Leah the math derivations for decision trees
- [ ] Add custom function to look at the term frequency distributions
- [ ] This is even what the scikit-learn docs do…they should just build it in to the algorithms…
- [ ] What is a good minimium document frequency for a term and what is a cutoff? 1/number of topics?
- [ ] Look at the example on sklearn for NMF and topic extraction
- [ ] Add tree visualization libraries to virtual environment: ETE or Graphviz (which is in sklearn now) or Plotly or treelib (which is the most simple)
- [ ] Sery bot for follow botting protection
Photo
- [ ] Try to see if the preview pane in Bridge can get shifted left a little