Download PDFOpen PDF in browser

Building UniGPT: A Customizable On-Premise LLM-Solution for Universities

9 pagesPublished: January 13, 2025

Abstract

Large Language Models (LLMs) have become the hot topic in Artificial Intelligence (AI) in the last few years, especially with the advent of the Generative Pretrained Transformer (GPT) models and the release of ChatGPT in November 2022. Demand from faculty members for access to such models quickly arose and proved to be hard to address in an orderly manner by central IT providers due to technical, privacy and payment constraints by the major suppliers. Additionally, specific research questions might require more control over the model and deployment. Thus, access to non-public on-premises models became desirable and became possible with open-source solutions (e.g. Llama 2 by Meta and Mixtral 8x7B by Mistral AI). Due to the large model sizes, on-premises deployments are demanding in terms of hardware and system engineering. In this paper, we present our deployment of a large language model for the University of Mu ̈nster, a service we call UniGPT. We focus on the high-level architecture, consisting of the frontend, backend, and models, and also discuss the experiences with our deployed service.

Keyphrases: artificial intelligence ai, large language model llm, on premises service deployment

In: Raimund Vogl, Laurence Desnos, Jean-François Desnos, Spiros Bolis, Lazaros Merakos, Gill Ferrell, Effie Tsili and Manos Roumeliotis (editors). Proceedings of EUNIS 2024 annual congress in Athens, vol 105, pages 108-116.

BibTeX entry
@inproceedings{EUNIS2024:Building_UniGPT_Customizable_Premise,
  author    = {Jonathan Radas and Benjamin Risse and Raimund Vogl},
  title     = {Building UniGPT: A Customizable On-Premise LLM-Solution for Universities},
  booktitle = {Proceedings of EUNIS 2024 annual congress in Athens},
  editor    = {Raimund Vogl and Laurence Desnos and Jean-François Desnos and Spiros Bolis and Lazaros Merakos and Gill Ferrell and Effie Tsili and Manos Roumeliotis},
  series    = {EPiC Series in Computing},
  volume    = {105},
  publisher = {EasyChair},
  bibsource = {EasyChair, https://easychair.org},
  issn      = {2398-7340},
  url       = {/publications/paper/CDHx},
  doi       = {10.29007/jv1l},
  pages     = {108-116},
  year      = {2025}}
Download PDFOpen PDF in browser