Hosting Multiple LLMs on a Single Endpoint - AWS SageMaker

Utilize SageMaker Inference Components to Host Flan & Falcon in a Cost & Performance Efficient Manner

Link