Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessDySCo: Dynamic Semantic Compression for Effective Long-term Time Series ForecastingarXivUQ-SHRED: uncertainty quantification of shallow recurrent decoder networks for sparse sensing via engressionarXivAn Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance AnalysisarXivMalliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement LearningarXivEfficient and Principled Scientific Discovery through Bayesian Optimization: A TutorialarXivMassively Parallel Exact Inference for Hawkes ProcessesarXivModel Merging via Data-Free Covariance EstimationarXivDetecting Complex Money Laundering Patterns with Incremental and Distributed Graph ModelingarXivForecasting Supply Chain Disruptions with Foresight LearningarXivSven: Singular Value Descent as a Computationally Efficient Natural Gradient MethodarXivSECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous DrivingarXivJetPrism: diagnosing convergence for generative simulation and inverse problems in nuclear physicsarXivBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessDySCo: Dynamic Semantic Compression for Effective Long-term Time Series ForecastingarXivUQ-SHRED: uncertainty quantification of shallow recurrent decoder networks for sparse sensing via engressionarXivAn Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance AnalysisarXivMalliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement LearningarXivEfficient and Principled Scientific Discovery through Bayesian Optimization: A TutorialarXivMassively Parallel Exact Inference for Hawkes ProcessesarXivModel Merging via Data-Free Covariance EstimationarXivDetecting Complex Money Laundering Patterns with Incremental and Distributed Graph ModelingarXivForecasting Supply Chain Disruptions with Foresight LearningarXivSven: Singular Value Descent as a Computationally Efficient Natural Gradient MethodarXivSECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous DrivingarXivJetPrism: diagnosing convergence for generative simulation and inverse problems in nuclear physicsarXiv
AI NEWS HUBbyEIGENVECTOREigenvector

b8640

llama.cpp Releasesby github-actions[bot]April 2, 20262 min read0 views
Source Quiz

tests : add unit test coverage for llama_tensor_get_type ( #20112 ) Add unit test coverage for llama_tensor_get_type Fix merge conflicts, add more schemas clang formatter changes Trailing whitespace Update name Start rebase Updating files with upstream changes prior to rebase Changes needed from rebase Update attn_qkv schema, change throw behaviour Fix merge conflicts White space Update with latest changes to state counters Revert accidental personal CLAUDE.md changes Change quotation mark Reuse metadata.name since we have it Move test-only stuff out of llama-quant.cpp Hide the regex functionality back in llama-quant.cpp, use a unique pointer to a new struct 'compiled_tensor_type_patterns' which contains the patterns cont : inital deslop guidelines Cleanup based on review comments Continue

tests : add unit test coverage for llama_tensor_get_type (#20112)

  • Add unit test coverage for llama_tensor_get_type

  • Fix merge conflicts, add more schemas

  • clang formatter changes

  • Trailing whitespace

  • Update name

  • Start rebase

  • Updating files with upstream changes prior to rebase

  • Changes needed from rebase

  • Update attn_qkv schema, change throw behaviour

  • Fix merge conflicts

  • White space

  • Update with latest changes to state counters

  • Revert accidental personal CLAUDE.md changes

  • Change quotation mark

  • Reuse metadata.name since we have it

  • Move test-only stuff out of llama-quant.cpp

  • Hide the regex functionality back in llama-quant.cpp, use a unique pointer to a new struct 'compiled_tensor_type_patterns' which contains the patterns

  • cont : inital deslop guidelines

  • Cleanup based on review comments

  • Continue cleanup

  • Small cleanup

  • Manually set proper ordering of tensors, mostly applies to gemma

  • Formatting

  • Update tests/test-quant-type-selection.cpp

Co-authored-by: Sigbjørn Skjæret [email protected]

  • Fix merge conflicts

Co-authored-by: Georgi Gerganov [email protected] Co-authored-by: Sigbjørn Skjæret [email protected]

macOS/iOS:

  • macOS Apple Silicon (arm64)

  • macOS Intel (x64)

  • iOS XCFramework

Linux:

  • Ubuntu x64 (CPU)

  • Ubuntu arm64 (CPU)

  • Ubuntu s390x (CPU)

  • Ubuntu x64 (Vulkan)

  • Ubuntu arm64 (Vulkan)

  • Ubuntu x64 (ROCm 7.2)

  • Ubuntu x64 (OpenVINO)

Windows:

  • Windows x64 (CPU)

  • Windows arm64 (CPU)

  • Windows x64 (CUDA 12) - CUDA 12.4 DLLs

  • Windows x64 (CUDA 13) - CUDA 13.1 DLLs

  • Windows x64 (Vulkan)

  • Windows x64 (SYCL)

  • Windows x64 (HIP)

openEuler:

  • openEuler x86 (310p)

  • openEuler x86 (910b, ACL Graph)

  • openEuler aarch64 (310p)

  • openEuler aarch64 (910b, ACL Graph)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

claudellamaupdate

Knowledge Map

Knowledge Map
TopicsEntitiesSource
b8640claudellamaupdatereviewllama.cpp R…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 326 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models