this post was submitted on 24 Jul 2024

45 points (95.9% liked)

Free Open-Source Artificial Intelligence

2889 readers

1 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

GitHub Stars

FOSAI Time Capsule

founded 1 year ago

MODERATORS

Blaed@lemmy.world

fosai@lemmy.world

Llama 3.1 Megathread (lemmy.world)

submitted 3 months ago by Blaed@lemmy.world to c/fosai@lemmy.world

12 comments fedilink hide all child comments

Meta has released and open-sourced Llama 3.1 in three different sizes: 8B, 70B, and 405B

This new Llama iteration and update brings state-of-the-art performance to open-source ecosystems.

If you've had a chance to use Llama 3.1 in any of its variants - let us know how you like it and what you're using it for in the comments below!

Llama 3.1 Megathread

For this release, we evaluated performance on over 150 benchmark datasets that span a wide range of languages. In addition, we performed extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios. Our experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. Additionally, our smaller models are competitive with closed and open models that have a similar number of parameters.

As our largest model yet, training Llama 3.1 405B on over 15 trillion tokens was a major challenge. To enable training runs at this scale and achieve the results we have in a reasonable amount of time, we significantly optimized our full training stack and pushed our model training to over 16 thousand H100 GPUs, making the 405B the first Llama model trained at this scale.

Official Meta News & Documentation

See also: The Llama 3 Herd of Models paper here:

https://ai.meta.com/research/publications/the-llama-3-herd-of-models/

HuggingFace Download Links

`8B`

Meta-Llama-3.1-8B

https://huggingface.co/meta-llama/Meta-Llama-3.1-8B

Meta-Llama-3.1-8B-Instruct

https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct

Llama-Guard-3-8B

https://huggingface.co/meta-llama/Llama-Guard-3-8B

Llama-Guard-3-8B-INT8

https://huggingface.co/meta-llama/Llama-Guard-3-8B-INT8

`70B`

Meta-Llama-3.1-70B

https://huggingface.co/meta-llama/Meta-Llama-3.1-70B

Meta-Llama-3.1-70B-Instruct

https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct

`405B`

Meta-Llama-3.1-405B-FP8

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-FP8

Meta-Llama-3.1-405B-Instruct-FP8

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct-FP8

Meta-Llama-3.1-405B

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B

Meta-Llama-3.1-405B-Instruct

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct

Getting the models

You can download the models directly from Meta or one of our download partners: Hugging Face or Kaggle.

Alternatively, you can work with ecosystem partners to access the models through the services they provide. This approach can be especially useful if you want to work with the Llama 3.1 405B model.

Note: Llama 3.1 405B requires significant storage and computational resources, occupying approximately 750GB of disk storage space and necessitating two nodes on MP16 for inferencing.

Learn more at:

https://llama.meta.com/docs/getting_the_models

Running the models

More guides and resources

How-to Fine-tune Llama 3.1 models

https://llama.meta.com/docs/how-to-guides/fine-tuning

Quantizing Llama 3.1 models

https://llama.meta.com/docs/how-to-guides/quantization

Prompting Llama 3.1 models

https://llama.meta.com/docs/how-to-guides/prompting

Llama 3.1 recipes

https://github.com/meta-llama/llama-recipes

YouTube media

Rowan Cheung - Mark Zuckerberg on Llama 3.1, Open Source, AI Agents, Safety, and more

https://www.youtube.com/watch?v=Vy3OkbtUa5k

Matthew Berman - BREAKING: LLaMA 405b is here! Open-source is now FRONTIER!

https://www.youtube.com/watch?v=JLEDwO7JEK4

Wes Roth - Zuckerberg goes SCORCHED EARTH.... Llama 3.1 BREAKS the "AGI Industry"*

https://www.youtube.com/watch?v=QyRWqJehK7I

1littlecoder - How to DOWNLOAD Llama 3.1 LLMs

https://www.youtube.com/watch?v=R_vrjOkGvZ8

Bloomberg - Inside Mark Zuckerberg's AI Era | The Circuit

https://www.youtube.com/watch?v=YuIc4mq7zMU

you are viewing a single comment's thread
view the rest of the comments

[–] badcodecat@lemux.minnix.dev 17 points 3 months ago (1 children)

super exciting, but in a way i have kind of "lost interest" in frontier models, since the resources needed to run them is beyond what most people have access to. i mostly see the future in smaller models (like 3.1 8B for example), anyone else share this feeling?

also unrelated but, i was previously librecat on here (my last instance stopped working)

[–] DreamDrifter@lemmynsfw.com 3 points 3 months ago

Agreed - 8b has enough magic to hold a conversation and do small tasks, such as breaking up a large task or picking out key details, which can then be fed into more small models (maybe even more narrowly fine-tuned ones)

180b isn't enough to replace all the other pieces of a system that you need for autonomous action or memory

I think 8b models are enough to make AGI possible if we stack them just right. They're enough to fill in most of the gaps to make practical things too, and they're not that far off for everything else

Free Open-Source Artificial Intelligence

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

FOSAI Time Capsule

Llama 3.1 Megathread

Official Meta News & Documentation

HuggingFace Download Links

8B

70B

405B

Getting the models

Running the models

Linux

Windows

Mac

Cloud

More guides and resources

YouTube media

`8B`

`70B`

`405B`

`Linux`

`Windows`

`Mac`

`Cloud`