๐ ModServe - Modality- and Stage-Aware Resource Disaggregation for Scalable Multimodal Model Serving
๐ Breaking the Wall: Unifying Edge GPUs and NPUs into Pipeline Parallelism for Efficient LLM Fine-Tuning