Data-Centric Lessons To Improve Speech-Language Pretraining | Flume