this post was submitted on 04 Sep 2023
327 points (96.1% liked)
Linux
48209 readers
1589 users here now
From Wikipedia, the free encyclopedia
Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).
Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.
Rules
- Posts must be relevant to operating systems running the Linux kernel. GNU/Linux or otherwise.
- No misinformation
- No NSFW content
- No hate speech, bigotry, etc
Related Communities
Community icon by Alpár-Etele Méder, licensed under CC BY 3.0
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Compile llama.cpp, download a small GGML LLM model and you will have a quite intelligent assiatant running into your phone.
Would that actually be decent? Even 6b models feel way too rudimentary after experiencing 33+b models and/or chatgpt. I haven't tried those really scaled down and optimized models, though!
They're decent for text completion purposes, e.g. generating some corpspeak for an email, or generating some "wikipedia"-like text. You have to know how to write good prompts, don't try to treat it like ChatGPT.
For example if i want to know about the history of Puerto Rico I would put:
"The history of puerto rico starts in about 480BC when"
Decent enough for a model 50 times smaller than ChatGPT. I use orca_mini_3b.
I got llama to compile but it crashes every time I tried running it.
Do we have any llama apk/deb/flatpack/appimage where you just choose the llm to use?
Idk if GPT4All is what you want.
Do we have apt install gtp4qll?
https://gpt4all.io/index.html
Says ubuntu, works fine on my non-apt distro.
lamacpp is kind of this. you give it a path to a model and a prompt. download a smallish model GGUF file by thebloke. lamacpp main exec has an interactive mode too which can function like a chatbot. i never built it on android though maybe you need android NDK