Stream notes
Video
Here’s the VOD:
Below is an embed
Summary
- Intro
- How was your weekend?
- Pokemon Go
- Dratini Community Day Classic
- Gimmighoul reveal
- Took a nice drive
- Pokemon Go
- Tidied up BuenosDS, and published a lot of notes
- How was your weekend?
- Coding
- Tried to finish migration from Heroku -> Render -> Digital Ocean
- Working on more feature agglomeration
- Possibly DBSCAN or HDBSCAN?# Stream notes
Video
Here’s the VOD:
Below is an embed
Summary
- Intro
- How was your weekend?
- Pokemon Go
- Dratini Community Day Classic
- Gimmighoul reveal
- Took a nice drive
- Pokemon Go
- Tidied up BuenosDS, and published a lot of notes
- How was your weekend?
- Coding
- Tried to finish migration from Heroku -> Render -> Digital Ocean
- Working on more feature agglomeration
- Possibly DBSCAN or HDBSCAN?
- Hyper parameter selection in Hierarchical Agglomerative Clustering
- Dimension reduction with DBSCAN
- Basically, we need to do dimension reduction regardless
- Let’s try (for both Kmeans, DBSCAN)
- [] FeatureAgglomeration
- [] tSVD (again)
- [] NMF
- Let’s try (for both Kmeans, DBSCAN)
- Let’s install Optuna!
- Go through the tutorials and stuff, maybe we can work on this this week
- DBCV for perfomance review instead of elbow method or silhouette analysis
- Installed via poetry+git!
- From chat/derail
- stream_kyle’s birthday tomorrow!
- Crono kinda wants melatonin
- From texoport
- MeaLeon redeem for cochinita pibil (Mexican) because david_delaune asked waht we’re working on
- dave_delaune knows someone who has a patent that reduced C/C++ code to AST to find plagiarism (sounds really cool)
- Charles Moyes (Microsoft -> NVIDIA -> Meta)
- We raided EroAxee
Shoutouts
Streamers who were active in chat
To Do
Code
- [ ] Add custom function to look at the term frequency distributions
- [ ] This is even what the scikit-learn docs do…they should just build it in to the algorithms…
- [ ] What is a good minimium document frequency for a term and what is a cutoff? 1/number of topics?
- [ ] Look at the example on sklearn for NMF and topic extraction
- [ ] add ingredient information to search result box
- [ ] Make the background tile
- [ ] take another look at streamlit
- [ ] Look up math theory behind t-SNE
- [ ] optuna for automated hyperparameter searching
- [ ] Should .lock and tool-versions be added to .gitignore? I’ve never committed them
- [ ] Refactor to use Pola.rs
- [ ] Write better/more thorough docstrings
- [ ] Look up examples of applying OOP practices to data science
- [ ] Make a PR to update the documentation for how **kwargs are used for sklearn Pipeline (their example and docs are seemingly incorrect)
- [ ] Make a PR to update the FeatureAgglomeration docs to have linkage ward also say "only euclidean is accepted"
- [ ] Make a PR to update FeatureAgglomeration docs: cosine affinity cannot work with sparse matrices
- [ ] Figure out the ideal process for putting a corpus with collection of documents through a sklearn pipeline considering you previously got the overall counts and then did tfidf on the individual recipes using the CV from overall.
Photo
- [ ] Try to see if the preview pane in Bridge can get shifted left a little
General Stream/Admin
-
[ ] How to change wordpress endpoint, specifically /admin
-
[ ] Make hand cam a separate scene?
-
[ ] Make subgoal buying a wet fart soundboard (thanks chat)
– [ ] CornoZeewo scuffed model -
[ ] Corno mode
-
[ ] CuwonoZeewo scuffed mode
-
[ ] Need a Klawful knife emote, or a shotgun (thanks R4D4R)
-
[ ] Install minecraft
-
[ ] Update suggested python dev list
- [ ] Add statquest channel
-
[ ] 2022.10.14 First ever follow bot attack
-
[ ] Expensive redeem: crono tells lore from vtuber chat
-
[ ] Add smort command
-
[ ] Maweexy wants "Data Da Da", maybe that can be a different mode
- [ ] Intelijens +1
-
[ ] !Keyboard command
-
[X] Really need to set up a timer for breaks/ads
- [ ] Or just have fewer ads, Crono
-
[ ] Add contributing guidelines to repo
-
[ ] Use KMeans to predict cluster for the Missing Cuisine recipes
- [ ] Move to rawer data (see step below)
-
[ ] Run clustering on the untransformed dataframes to see what we get out
-
[] Update README
-
[] Check speed limiter on Edamam, it may be too restrictive
- See if there’s an error code to diagnose
-
For next stream
- Classify and or cluster the missing labels to see how they can augment existing recipe database
-
[] 2022.10.31 Raided 4 times!
-
[X] Migrate to DigitalOcean already
- Flask app working in Dev!
- Would like to make space on BuenosDS.dev/MeaLeon for it to run
-
[] Add button to copy links in markdown format in MeaLeon
- Hyper parameter selection in Hierarchical Agglomerative Clustering
- Dimension reduction with DBSCAN
- Basically, we need to do dimension reduction regardless
- Let’s try (for both Kmeans, DBSCAN)
- [] FeatureAgglomeration
- [] tSVD (again)
- [] NMF
- Let’s try (for both Kmeans, DBSCAN)
- Let’s install Optuna!
- Go through the tutorials and stuff, maybe we can work on this this week
- DBCV for perfomance review instead of elbow method or silhouette analysis
- Installed via poetry+git!
- From chat/derail
- stream_kyle’s birthday tomorrow!
- Crono kinda wants melatonin
- From texoport
- MeaLeon redeem for cochinita pibil (Mexican) because david_delaune asked waht we’re working on
- dave_delaune knows someone who has a patent that reduced C/C++ code to AST to find plagiarism (sounds really cool)
- Charles Moyes (Microsoft -> NVIDIA -> Meta)
- We raided EroAxee
Shoutouts
Streamers who were active in chat
To Do
Code
- [ ] Add custom function to look at the term frequency distributions
- [ ] This is even what the scikit-learn docs do…they should just build it in to the algorithms…
- [ ] What is a good minimium document frequency for a term and what is a cutoff? 1/number of topics?
- [ ] Look at the example on sklearn for NMF and topic extraction
- [ ] add ingredient information to search result box
- [ ] Make the background tile
- [ ] take another look at streamlit
- [ ] Look up math theory behind t-SNE
- [ ] optuna for automated hyperparameter searching
- [ ] Should .lock and tool-versions be added to .gitignore? I’ve never committed them
- [ ] Refactor to use Pola.rs
- [ ] Write better/more thorough docstrings
- [ ] Look up examples of applying OOP practices to data science
- [ ] Make a PR to update the documentation for how **kwargs are used for sklearn Pipeline (their example and docs are seemingly incorrect)
- [ ] Make a PR to update the FeatureAgglomeration docs to have linkage ward also say "only euclidean is accepted"
- [ ] Make a PR to update FeatureAgglomeration docs: cosine affinity cannot work with sparse matrices
- [ ] Figure out the ideal process for putting a corpus with collection of documents through a sklearn pipeline considering you previously got the overall counts and then did tfidf on the individual recipes using the CV from overall.
Photo
- [ ] Try to see if the preview pane in Bridge can get shifted left a little
General Stream/Admin
- [ ] How to change wordpress endpoint, specifically /admin
- [ ] Make hand cam a separate scene?
- [ ] Make subgoal buying a wet fart soundboard (thanks chat)
– [ ] CornoZeewo scuffed model - [ ] Corno mode
- [ ] CuwonoZeewo scuffed mode
- [ ] Need a Klawful knife emote, or a shotgun (thanks R4D4R)
- [ ] Install minecraft
- [ ] Update suggested python dev list
- [ ] Add statquest channel
- [ ] 2022.10.14 First ever follow bot attack
- [ ] Expensive redeem: crono tells lore from vtuber chat
- [ ] Add smort command
- [ ] Maweexy wants "Data Da Da", maybe that can be a different mode
- [ ] Intelijens +1
- [ ] !Keyboard command
- [X] Really need to set up a timer for breaks/ads
- [ ] Or just have fewer ads, Crono
- [ ] Add contributing guidelines to repo
- [ ] Use KMeans to predict cluster for the Missing Cuisine recipes
- [ ] Move to rawer data (see step below)
- [ ] Run clustering on the untransformed dataframes to see what we get out
- [] Update README
- [] Check speed limiter on Edamam, it may be too restrictive
- See if there’s an error code to diagnose
- For next stream
- Classify and or cluster the missing labels to see how they can augment existing recipe database
- [] 2022.10.31 Raided 4 times!
- [X] Migrate to DigitalOcean already
- Flask app working in Dev!
- Would like to make space on BuenosDS.dev/MeaLeon for it to run
- [] Add button to copy links in markdown format in MeaLeon