Self hosted LLM

posted 7 months ago

HumanPerson@sh.itjust.works

Hello internet users. I have tried gpt4all and like it, but it is very slow on my laptop. I was wondering if anyone here knows of any solutions I could run on my server (debian 12, amd cpu, intel a380 gpu) through a web interface. Has anyone found any good way to do this?

Sort:

Hot Top Controversial New Old

[ - ]

h3ndrik@feddit.de

23 points

7 months ago

kobold.cpp is easy to use, fast and I like it.

If you’re interested in more relevant Lemmy communities:

(another option: text-generation-webui has several backends bundled. Maybe one of those works for you.)

permalink

report

[ - ]

Scrubbles@poptalk.scrubbles.tech

8 points

7 months ago

text-generation-webui is kind of the standard from what I’ve seen to run it with a webui, but the vram stuff here is accurate. Text LLMs require an insane amount of vram to keep a conversation going.

permalink

report

[ - ]

johntash@eviltoast.org

7 points

7 months ago

Ollama and localai can both be run on a server with no gpu. You’d need to point a different web ui to them if you want though

permalink

report

[ - ]

Morethanevil@lemmy.fedifriends.social

6 points

7 months ago

There is an easy way with OpenWebUI but LLM are mostly accelerated by CUDA or ROCm. CPU acceleration is slow, but you can try it

permalink

report

[ - ]

slacktoid@lemmy.ml

5 points

7 months ago

Ollama is a nice server base, they lots of projects that plug on top of that.

permalink

report

Selfhosted

!selfhosted@lemmy.world

Create post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Community stats

4.7K
Monthly active users
3.5K
Posts
78K
Comments

Community stats

Community moderators