SPA: Towards a Computational Friendly Cloud-Base and on-Devices Collaboration Seq2seq Personalized Generation with Causal InferenceEasyChair Preprint 15343, version historyKeyphrases: Cloud-device Collaboration, Personalized LLM, inference acceleration |