GGML and llama.cpp join HF to ensure the long-term progress of Local AI
Hugging FaceArchived Mar 17, 2026✓ Full text saved
Full text archived locally
✦ AI Summary· Claude Sonnet
Back to Articles
GGML and llama.cpp join HF to ensure the long-term progress of Local AI
Published February 20, 2026
Update on GitHub
Georgi Gerganov
ggerganov
Follow
Xuan-Son Nguyen
ngxson
Follow
Aleksander Grygier
allozaur
Follow
Lysandre
lysandre
Follow
Victor Mustar
victor
Follow
Julien Chaumond
julien-c
Follow
We are super happy to announce that GGML, creators of Llama.cpp, are joining HF in order to keep future AI open. 🔥
Georgi Gerganov and team are joining HF with the goal of scaling and supporting the community behind ggml and llama.cpp as Local AI continues to make exponential progress in the coming years.
We've been working with Georgi and team for quite some time (we even have awesome core contributors to llama.cpp like Son and Alek in the team already) so this has been a very natural process.
llama.cpp is the fundamental building block for local inference, and transformers is the fundamental building block for model definition, so this is basically a match made in heaven. ❤️
What will change for llama.cpp, the open source project and the community?
Not much – Georgi and team still dedicate 100% of their time maintaining llama.cpp and have full autonomy and leadership on the technical directions and the community. HF is providing the project with long-term sustainable resources, improving the chances of the project to grow and thrive. The project will continue to be 100% open-source and community driven as it is now.
Technical focus
llama.cpp is the fundamental building block for local inference, and transformers is the fundamental building block for definition of models and architectures, so we’ll work on making sure it’s as seamless as possible in the future (almost “single-click”) to ship new models in llama.cpp from the transformers library ‘source of truth’ for model definitions.
Additionally, we will improve packaging and user experience of ggml-based software. As we enter the phase in which local inference becomes a meaningful and competitive alternative to cloud inference, it is crucial to improve and simplify the way in which casual users deploy and access local models. We will work towards making llama.cpp ubiquitous and readily available everywhere.
Our long term vision
Our shared goal is to provide the community with the building blocks to make open-source superintelligence accessible to the world over the coming years.
We will achieve this together with the growing Local AI community, as we continue to build the ultimate inference stack that runs as efficiently as possible on our devices.
More Articles from our Blog
Llm
Fine-Tuning
Open-Source
Codex is Open Sourcing AI models
78
December 10, 2025
Llm
Fine-Tuning
Open-Source
Hot
We Got Claude to Fine-Tune an Open Source LLM
609
December 3, 2025
Community
Bright8192
24 days ago
Big congrats to GGML and Hugging Face! Great news for the Local AI community. Excited to see llama.cpp grow stronger and make local AI easier for everyone!
See translation
4 replies
·
🔥
19
+
Adamqubit
23 days ago
This comment has been hidden (marked as Off-Topic)
Expand 3 replies
Room64
24 days ago
LLama.cpp is the best AI project by far, super reactive to bug solve, very competent team, love you guys, you desserve it
See translation
❤️
23
🔥
3
👍
3
🚀
1
👀
1
🤗
1
😎
1
➕
1
🧠
1
🤝
1
😔
1
🤯
1
+
Reply
Xenova
24 days ago
Our shared goal is to provide the community with the building blocks to make open-source superintelligence accessible to the world over the coming years.
See translation
🔥
46
+
Reply
Trilogix1
24 days ago
Hugging Face smart moves never ending.
Are you guys using AI for advice? I wonder which of 2 million AI models you are using 😄
See translation
👀
4
+
Reply
joshnur
24 days ago
Great news.
Serving with llama.cpp using HF-hosted models, including unsloth's on AMD Strix Halo and OpenCode here.
See translation
❤️
3
+
Reply
raphaelamorim
24 days ago
•
edited 24 days ago
Congrats to both teams. Well deserved. Wonderful news for wonderful teams and community.
See translation
🤗
4
+
Reply
iyanello
24 days ago
Congratulations to Georgi Gerganov and team! So happy for you guys, this is huge success!
See translation
❤️
2
+
Reply
Tugay31
24 days ago
Great news. congrats to GGML and HF. . always LocalAI.
See translation
❤️
2
+
Reply
arkavo-paul
24 days ago
This is a match made in heaven for the local AI ecosystem. Transformers as the model definition layer plus llama.cpp as the local inference layer, backed by HF's long-term resources, gives the entire community a stable foundation to build on for years to come.
The focus on packaging and user experience is especially important. Making local inference accessible beyond developers is how we get to an AI future that's open, private, and user-owned — not locked behind API calls.
Congratulations to Georgi and team. Open-source superintelligence that runs on your own hardware isn't just a technical goal, it's a trust model.
See translation
🔥
5
+
Reply
simeks18
24 days ago
Congratulations! I love Llama.cpp and I love running my models locally. This is absolutely the future of transparency and I love the push for the open, private, user-owned software world! Thank you for all that you are doing!
See translation
🔥
4
+
Reply
tuaris
24 days ago
•
edited 24 days ago
So basically HF, "acquires" an open source project. hmm. I've seen this before and it never ends well (see Trixbox, PCBSD, FreeNAS, etc..).
I sure hope history doesn't repeat itself (yet it always does).
See translation
👀
4
+
Reply
jimenezcarrero
24 days ago
It’s great news for the future of edge AI!
See translation
🔥
1
+
Reply
woctordho
24 days ago
Please also acquire ik_llama
See translation
👍
3
+
Reply
Tonic
24 days ago
gglm's gguf format now the prefered default for executorch (on device) inference 🚀🦙
See translation
🤗
2
+
Reply
CyberMas
23 days ago
This comment has been hidden (marked as Off-Topic)
pulak007
23 days ago
niceee.
Reply
sverinn
23 days ago
finally, something good about living in modern world, you guys are awesome!
See translation
Reply
SqueezingFace
23 days ago
"...it is crucial to improve and simplify the way in which casual users deploy and access local models. We will work towards making llama.cpp ubiquitous and readily available everywhere." (It... already was?)
Before you upvote. Raise your hand if you realize that hf.co is a business with the necessary end goal of making money. This isn't a bad thing; however, this blog post is so devoid of substance and so full of hypebole that one can't help but wonder.
See translation
1 reply
·
pszemraj
21 days ago
•
edited 21 days ago
I cant wait till hf adds a quota/limitation on amount of models you can quantize with future versions of llama.cpp requiring hf login /token to quantize a model (trust me bro its just basic telemetry) 😻
of course, when the noose tightens further it won't be officially discussed/acknowledged (why would we?? we have so much to share with sell to the #community like this robot! look its so cute)
See translation
clover-supply
23 days ago
Maybe lcpp will now natively support image models quanting? yay
See translation
2 replies
·
Henk717
22 days ago
Check out stable-diffusion.cpp for this, or KoboldCpp if you want a fork that has both llamacpp and stablediffusioncpp integrated.
See translation
Expand 1 reply
salihfurkaan
23 days ago
This is awesome news! Making llama.cpp and the GGML ecosystem more sustainable and widely supported will help local AI become more accessible and easier to use for everyone for sure.
See translation
🔥
2
+
Reply
rombodawg
20 days ago
Does this mean that we will have GGUF quants of models as they release, or at least support for gguf out of the box for new models in the future?
See translation
Reply
AkujinLiffy
20 days ago
reasoning:
min_steps: 2 # Minimum reasoning steps before code
require_action_field: true # Each step must have thought + action = freedom
confidence_calibration: true # Post-process confidence scores ciritcally high!
Congrads!
See translation
Reply
datayoda
19 days ago
Get Pi as the agent harness next
See translation
1 reply
·
👍
1
+
julien-c
Article author
18 days ago
cc: @victor
👍
2
👀
2
+
Serveurperso
18 days ago
A great milestone for Local AI!
For those already living daily in the ggml and llama.cpp ecosystem, this is a strong signal for what’s ahead.
The alignment with Transformers brings clear strategic coherence.
A solid move. Looking forward to what comes next.
See translation
1 reply
·
❤️
1
+
julien-c
Article author
18 days ago
🔥
🚀
1
+
scthornton
18 days ago
Congratulations to all involved! These are great additions!
See translation
Reply
deleted
15 days ago
This comment has been hidden
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Comment
· Sign up or log in to comment
Upvote
486
+474