Thursday, 10/26/2023

Log:

We met 5:30 - 9:30 PM and attempted to train a sparse autoencoder using Logan’s code.
We ran into some obstacles loading data.
- Insight: Pythia 70M chess needs chess data!
- Insight: load small datasets on to Colab, not a billion tokens!
- Insight: just ask Logan :)

Todos:

Naomi might try to make the sparse autoencoders train tonight.
- Upload to HuggingFace
- Look at the autoencoder using Neel Nanda’s library
Try to fine tune the sparse autoencoder on a dataset heavy on one feature. This task will help us practice fine tuning before we move on to RLHF. We’ll start on Friday.