Self-hosting LLMs

GreenSofaBed@lemmy.zip · 17 hours ago

Self-hosting LLMs

Showroom7561@lemmy.ca · edit-2 15 hours ago

You can run this right from Windows: https://jan.ai/

You’ll need a lot of RAM, and processing is decently fast, even on a basic laptop.

edit: holy hell. Grammar.

dangling_cat@lemmy.blahaj.zone · 14 hours ago

Tip: you can copy and paste the Hugging Face link directly into the search box, and it will download the model automatically! Also, it’s pretty smart. It will load into your VRAM first, then your RAM. If you can fit everything into VRAM, you get the fastest speed. But even if you are using RAM, it’s not terribly bad; it’s still faster than you can read.

GreenSofaBed@lemmy.zip · 12 hours ago

This is pretty cool!