overview for behohippy

Moore’s Law for AI. Is there such a thing? by behohippy in c/nostupidquestions@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago

The advancements in this space have moved so fast, it's hard to extract a predictive model on where we'll end up and how fast it'll get there.

Meta releasing LLaMA produced a ton of innovation from open source that showed you could run models that were nearly the same level as ChatGPT with less parameters, on smaller and smaller hardware. At the same time, almost every large company you can think of has prioritized integrating generative AI as a high strategic priority with blank cheque budgets. Whole industries (also deeply funded) are popping up around solving the context window memory deficiencies, prompt stuffing for better steerability, better summarization and embedding of your personal or corporate data.

We're going to see LLM tech everywhere in everything, even if it makes no sense and becomes annoying. After a few years, maybe it'll seem normal to have a conversation with your shoes?

MacOS Install Time by behohippy in c/apple@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago

I'm not sure either, Win 10/11 are pretty quick to get going and Ubuntu is not much longer than that. If I have to hard reset the mbp for work, it's a nice block of slacker time :)

The Weekly 'What are you playing?' Discussion - 20-07-2023 by behohippy in c/games@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago

Halls of Torment. $5 game on steam that is like a Vampire Survivors clone, but with more rpg elements to it.

Intel is quitting on its adorable, powerful, and upgradable mini NUC computers by behohippy in c/selfhosted@lemmy.world

[-] behohippy@lemmy.world 4 points 1 year ago

These are amazing. Dell, Lenovo and I think HP made these tiny things and they were so much easier to get than Pi's during the shortage. Plus they're incredibly fast in comparison.

The GPT-3 Architecture, on a Napkin by behohippy in c/artificial_intel@lemmy.ml

[-] behohippy@lemmy.world 4 points 1 year ago

I've got a background in deep learning and I still struggle to understand the attention mechanism. I know it's a key/value store but I'm not sure what it's doing to the tensor when it passes through different layers.

New android podcast "android faithful" to fill the hole the cancelled twit show, All about Android, left in our hearts. by behohippy in c/android@lemmy.ml

[-] behohippy@lemmy.world 4 points 1 year ago

Subscribed. That last episode of AAA was heartbreaking.

We just hit 1150 subscribers and 220 posts 🔥✊. If you are a lurker please help us grow the community by commenting and posting. ✌️ by behohippy in c/singularity@lemmy.fmhy.ml

[-] behohippy@lemmy.world 2 points 1 year ago

I'm on lemmy.world and the sidebar shows 401 subscribers. Is that just a sub count from the local instance or global?

Microsoft LongNet: One BILLION Tokens LLM — David Shapiro ~ AI (06.07.2023) by behohippy in c/singularity@lemmy.fmhy.ml

[-] behohippy@lemmy.world 4 points 1 year ago

Also not sure how that would be helpful. If every prompt needs to rip through those tokens first, before predicting a response, it'll be stupid slow. Even now with llama.cpp, it's annoying when it pauses to do the context window shuffle thing.

0

Need some nginx magic (lemmy.world)

submitted 1 year ago by behohippy@lemmy.world to c/selfhosted@lemmy.world

1 comments fedilink

I host a ton of services running behind my nginx reverse proxy (basic auth + lets encrypt). On the whole it works really well with nearly everything I throw at it. Lately, there's been a lot of gradio/websocket/python stuff coming from the AI community like the local llama and stable diffusion stuff. Not sure what's causing it but there's always weird issues when I try to reverse proxy them.

Does anyone have some magic settings that "just work" with these weirdo web apps?

4

Happy Barkday (lemmy.world)

submitted 1 year ago by behohippy@lemmy.world to c/aww@lemmy.ml

0 comments fedilink

He's 5 today

What are YOU self-hosting? by behohippy in c/selfhosted@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago

Are you running your own mail server? I only ever integrated Spamassassin with postfix.

What are YOU self-hosting? by behohippy in c/selfhosted@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago

Stable Diffusion (Stability AI version), text-generation-webui (WizardLM), a text embedder service with Spacy, Bert and a bunch of sentence-transformer models, PiHole, Octoprint, Elasticsearch/Kibana for my IoT stuff, Jellyfin, Sonarr, FTB Minecraft (customized pack), a few personal apps I wrote myself (todo lists), SMB file shares, qBittorrent and Transmission (one dedicated to Sonarr)... Probably a ton of other stuff I'm forgetting.

My home server setup by behohippy in c/selfhosted@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago

Yup, typically we get into it after upgrading an older PC or something and instead of selling the parts, just turn it into a server. You can also find all sorts of cheap/good stuff on ebay from office off-lease.

My home server setup by behohippy in c/selfhosted@lemmy.world

[-] behohippy@lemmy.world 2 points 1 year ago* (last edited 1 year ago)

Wow, a reply I made in another community ended up under this one. Yeah I'm doing a lot of work on local models and text embedding models for vector search.

59

My home server setup (lemmy.world)

submitted 1 year ago by behohippy@lemmy.world to c/selfhosted@lemmy.world

12 comments fedilink

Ryzen 5900X, 64 gig DDR4-3200, 2tb ssd,10tb hdd and an RTX2070. Hosting Stable Diffusion, various llama.cpp instances with python bindings, jellyfin, sonarr, multiple modded minecraft servers, and a network file share.

14

Attitude Dog (lemmy.world)

submitted 1 year ago by behohippy@lemmy.world to c/aww@lemmy.ml

0 comments fedilink

She's mostly good. Mostly.