Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can try skypilot: https://skypilot.readthedocs.io/en/latest/

It handles storage, setup, etc for machine learning work loads across several providers - which helps a lot if you need one of the instances that rarely have capacity like 8x A100 pods.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: