cabelo - Slashdot User

Submission + - OpenVINO 2025.2.0 Now Supports GGUF — Learn How to Use It (medium.com)

Submitted by cabelo on Tuesday July 01, 2025 @04:55PM

cabelo writes: I consider the GGUF format to be one of the most efficient for model inference, as it is a highly optimized binary format for fast loading and saving. Its efficiency is so great that it is recommended by Meta itself on the official LLaMA project page. Now, GGUF is also compatible with OpenVINO — a project of which I am a maintainer on the openSUSE platform. With OpenVINO, it is possible to perform inference on ARM and Intel processors, without depending on GPU. Below, I present a tutorial with the steps to use it.

Submission + - MultiCortex EXO: LLM HPC for Dummies (opensuse.org)

Submitted by cabelo on Friday January 31, 2025 @09:23PM

cabelo writes: Many people have GPUs of 8 GB or less, which is not enough to run larger models. The EXO project brings cluster technology to your home, allowing you to create residential supercomputers. MultiCortex EXO live was developed to make this possible without requiring technical knowledge: just boot the system from a USB flash drive, and all the machines connect automatically, forming a supercomputer that can run large LLMs such as LLAMA, DeepSeek and others. Demonstration video : https://www.youtube.com/watch?...

Slashdot Top Deals