You can do it locally now pretty easily depending on your use ass and hardware, huggingface has all the models you'd need and use something like llama-swap