Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessVCs are covering expenses like rent for young college dropouts founding AI startups; Antler: average AI unicorn founder age fell from 40 in 2020 to 29 in 2024 (Kate Clark/Wall Street Journal)TechmemeRunning OpenClaw with Gemma 4 TurboQuant on MacAir 16GBReddit r/LocalLLaMAStop Explaining Your Codebase to Your AI Every TimeDEV Community📙 Journal Log no. 1 Linux Unhatched ; My DevSecOps JourneyDEV CommunityPerplexity AI Under Fire In Lawsuit Alleging Privacy Violations - AOL.comGNews AI privacySTEEP: Your repo's fortune, steeped in truth.DEV CommunityVCSU Hosting Free Public Lecture on (AI) Artificial Intelligence - newsdakota.comGoogle News: AI[D] KDD Review DiscussionReddit r/MachineLearningI Built an MCP Server That Understands Your MSBuild Project Graph — Before You BuildDEV CommunityGemma 4 31B beats several frontier models on the FoodTruck BenchReddit r/LocalLLaMA1 Artificial Intelligence (AI) Stock That Could Be Worth a Fortune by 2030 - finance.yahoo.comGoogle News: AI1 Artificial Intelligence (AI) Stock That Could Be Worth a Fortune by 2030 - fool.comGoogle News: AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessVCs are covering expenses like rent for young college dropouts founding AI startups; Antler: average AI unicorn founder age fell from 40 in 2020 to 29 in 2024 (Kate Clark/Wall Street Journal)TechmemeRunning OpenClaw with Gemma 4 TurboQuant on MacAir 16GBReddit r/LocalLLaMAStop Explaining Your Codebase to Your AI Every TimeDEV Community📙 Journal Log no. 1 Linux Unhatched ; My DevSecOps JourneyDEV CommunityPerplexity AI Under Fire In Lawsuit Alleging Privacy Violations - AOL.comGNews AI privacySTEEP: Your repo's fortune, steeped in truth.DEV CommunityVCSU Hosting Free Public Lecture on (AI) Artificial Intelligence - newsdakota.comGoogle News: AI[D] KDD Review DiscussionReddit r/MachineLearningI Built an MCP Server That Understands Your MSBuild Project Graph — Before You BuildDEV CommunityGemma 4 31B beats several frontier models on the FoodTruck BenchReddit r/LocalLLaMA1 Artificial Intelligence (AI) Stock That Could Be Worth a Fortune by 2030 - finance.yahoo.comGoogle News: AI1 Artificial Intelligence (AI) Stock That Could Be Worth a Fortune by 2030 - fool.comGoogle News: AI
AI NEWS HUBbyEIGENVECTOREigenvector

Vulkan backend much easier on the CPU and GPU memory than CUDA.

Reddit r/LocalLLaMAby /u/Im_Still_Here12 https://www.reddit.com/user/Im_Still_Here12April 2, 20261 min read2 views
Source Quiz

On linux and compiled my own llama.cpp with CUDA support, top would always show one pegged CPU core at 100% when running Qwen3.5-9B-GGUF:Q4_K_M on my potato like RTX A2000 12GB. Also, nvidia-smi would show 11GB+ of GPU memory usage. Speed is ~30 tokens per second. My system fans would spin up when this single core gets pegged which was annoying to listen to. Decided to compile llama.cpp again with Vulkan backend to see if anything would be different. Well it was a big difference when using the exact same model Now, top is only showing one CPU core at about 30% usage and nvidia-smi is only showing 7.2GB of GPU memory usage. Speed is the same at ~30 tokens per second. No longer have my system fan spinning up when running inferencing. Just curious why the GPU memory footprint is lower and CPU

Could not retrieve the full article text.

Read on Reddit r/LocalLLaMA →
Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

llamamodelllama.cpp

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Vulkan back…llamamodelllama.cppReddit r/Lo…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 165 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!