Stream notes
Video
Here’s the VOD:
Below is an embed
Summary
- Intro
- How was your day?
- Work
- Got Pokemon Scarlet
- How was your day?
- Coding
- HDBSCAN – Hierarchical Density-Based Spatial Clustering of Applications with Noise
- Get predictions/labels out
- Plot a tree visualization
- From chat/derail
- Pandapoopums says to try hair oil!
- Also get a 2nd set of sheets, like an adult
- Name a pokemon after Pandapoopums!
- A pandalike one or wooper (woopums)
- New additional video capture card, less expensive than Elgato
- Music
- We raided AriaADM
Shoutouts
Streamers who were active in chat
To Do
Code
- [ ] Add custom function to look at the term frequency distributions
- [ ] This is even what the scikit-learn docs do…they should just build it in to the algorithms…
- [ ] What is a good minimium document frequency for a term and what is a cutoff? 1/number of topics?
- [ ] Look at the example on sklearn for NMF and topic extraction
- [ ] add ingredient information to search result box
- [ ] Make the background tile
- [ ] take another look at streamlit
- [ ] Look up math theory behind t-SNE
- [ ] optuna for automated hyperparameter searching
- [ ] Should .lock and tool-versions be added to .gitignore? I’ve never committed them
- [ ] Refactor to use Pola.rs
- [-] Write better/more thorough docstrings
- [ ] Look up examples of applying OOP practices to data science
- [ ] Make a PR to update the documentation for how **kwargs are used for sklearn Pipeline (their example and docs are seemingly incorrect)
- [ ] Make a PR to update the FeatureAgglomeration docs to have linkage ward also say "only euclidean is accepted"
- [ ] Make a PR to update FeatureAgglomeration docs: cosine affinity cannot work with sparse matrices
- [ ] Add contributing guidelines to repo
- [ ] Use KMeans to predict cluster for the Missing Cuisine recipes
- [ ] Move to rawer data (see step below)
- [ ] Run clustering on the untransformed dataframes to see what we get out
- [] Update README
- [] Check speed limiter on Edamam, it may be too restrictive
- See if there’s an error code to diagnose
- [] Would be cool to have something like this graph
- Have interactivity to display the closest recipes to your searched recipe
- [] Link VSCode to the droplet (can use SSH)
- [] How to (check vod from 11.11)
- [] Add button to copy links in markdown format in MeaLeon
- [] Reenable TFIDF instead of the OHE vectors
Photo
- [ ] Try to see if the preview pane in Bridge can get shifted left a little
General Stream/Admin
-
[ ] How to change wordpress endpoint, specifically /admin
-
[ ] Make hand cam a separate scene?
-
[ ] Make subgoal buying a wet fart soundboard (thanks chat)
– [ ] CornoZeewo scuffed model -
[ ] Corno mode
-
[ ] CuwonoZeewo scuffed mode
-
[ ] Need a Klawful knife emote, or a shotgun (thanks R4D4R)
-
[ ] Install minecraft
-
[ ] Update suggested python dev list
- [ ] Add statquest channel
-
[ ] Expensive redeem: crono tells lore from vtuber chat (Requested by maweexy)
-
[ ] Add smort command
-
[ ] Maweexy wants "Data Da Da", maybe that can be a different mode
- [ ] Intelijens +1
-
[X] Really need to set up a timer for breaks/ads
- [ ] Or just have fewer ads, Crono
-
[ ] Fix me with small screen scene
-
[X] Add a raid command that shouts out raiders and links to an intro/MeaLeon
-
check out deadmau5 one day
- LTT x deadmau5
-
What were the results of the pun poll?
-
[] Make a command to link directly to dataset uploaded to Kaggle
-
Get Hoppip and Bulbasaur planters
-
For next stream
- Classify and or cluster the missing labels to see how they can augment existing recipe database
- Can do another dimension reduction to make a 2D visualization after attaching HDBSCAN labels to the original data
- Mix in the probability to impact "density" of color (like the HDBSCAN plot examples…but in Bokeh)
- Refactor existing template for Bokeh
Or
- See how the HDBSCAN labels "map" to cuisine labels
- Go back, reattach index labels to data so that you can compare the original recipes with the HDBSCAN labels
- Predict HDBSCAN labels for the recipes that are missing cuisines and see where they line up (noise, etc)
- But also maybe compared to cuisine labels…depending on how many "noisy" recipes there are, this might not be very productive, but see how this looks
- Could this new HDBSCAN label used instead of a hard mapped cuisine label? What would the results look like?
- Gets a little tricky if recipes map to noise
- This involves a refactor of MeaLeon’s webapp (which is overdue)
- Can still do 2D visualization to have something up
- Mix in the probability to impact "density" of color (like the HDBSCAN plot examples…but in Bokeh)