We want to annotate dolci data with propella annotations for dataset selection and other purposes.
First we would want to analyse qualitatively whether the annotation makes sense for instruction data (propella was validated for pretraining raw data).
Then, we want to estimate the compute required before completing the annotation.