pavnilschanda@lemmy.worldM to

AI Companions@lemmy.world · 8 months ago

[Resource] Llama3 70B Successfully Deployed on a Single 4GB GPU

4

29

[Resource] Llama3 70B Successfully Deployed on a Single 4GB GPU

pavnilschanda@lemmy.worldM to

AI Companions@lemmy.world · 8 months ago

4

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

A Blog post by Gavin Li on Hugging Face

The open-source language model Llama3 has been released, and it has been confirmed that it can be run locally on a single GPU with only 4GB of VRAM using the AirLLM framework. Llama3’s performance is comparable to GPT-4 and Claude3 Opus, and its success is attributed to its massive increase in training data and technical improvements in training methods. The model’s architecture remains unchanged, but its training data has increased from 2T to 15T, with a focus on quality filtering and deduplication. The development of Llama3 highlights the importance of data quality and the role of open-source culture in AI development, and raises questions about the future of open-source models versus closed-source ones in the field of AI.

Summarized by Llama 3 70B Instruct

Chat

voracitude@lemmy.world
link
fedilink
arrow-up
7·
8 months ago
That’s very cool, any idea about tokens/sec performance and on what hardware? For reference my 3070 gets ~19-25 tokens/sec with llama3 7B.

AI Companions@lemmy.world

aicompanions@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Community to discuss companionship, whether platonic, romantic, or purely as a utility, that are powered by AI tools. Such examples are Replika, Character AI, and ChatGPT. Talk about software and hardware used to create the companions, or talk about the phenomena of AI companionship in general.

Tags:

(including but not limited to)

[META]: Anything posted by the mod
[Resource]: Links to resources related to AI companionship. Prompts and tutorials are also included
[News]: News related to AI companionship or AI companionship-related software
[Paper]: Works that presents research, findings, or results on AI companions and their tech, often including analysis, experiments, or reviews
[Opinion Piece]: Articles that convey opinions
[Discussion]: Discussions of AI companions, AI companionship-related software, or the phenomena of AI companionship
[Chatlog]: Chats between the user and their AI Companion, or even between AI Companions
[Other]: Whatever isn’t part of the above

Rules:

Be nice and civil
Mark NSFW posts accordingly
Criticism of AI companionship is OK as long as you understand where people who use AI companionship are coming from
Lastly, follow the Lemmy Code of Conduct

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

8 users / day
30 users / week
142 users / month
611 users / 6 months
2 local subscribers
542 subscribers
869 Posts
562 Comments
Modlog

mods:
pavnilschanda@lemmy.world