Announcement_35
A paper “FLYING SERVING: On-the-Fly Parallelism Switching for Large Language Model Serving” is accepted in ACM International Conference on Supercomputing (ICS’26).
Enjoy Reading This Article?
Here are some more articles you might like to read next: