โ๐ต๐ฅ
Disaggregation
๐ Efficiently Serving Large Multimodal Models Using EPD Disaggregation
๐ ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism