Gemma 4 - 4B vs Qwen 3.5 - 9B ?

Reddit r/LocalLLaMAby /u/No-Mud-1902 https://www.reddit.com/user/No-Mud-1902April 4, 20261 min read0 views

Hello! anyone tried the 4B Gemma 4 model and the Qwen 3.5 9B model and can tell us their feedback? On the benchmark Qwen seems to be doing better, but I would appreciate any personal experience on the matter Thanks! submitted by /u/No-Mud-1902 [link] [comments]

Could not retrieve the full article text.

Read on Reddit r/LocalLLaMA →

Original source

Reddit r/LocalLLaMA

https://www.reddit.com/r/LocalLLaMA/comments/1sc40bk/gemma_4_4b_vs_qwen_35_9b/

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelbenchmark

ModelsFresh

Vectorless RAG: How I Built a RAG System Without Embeddings, Databases, or Vector Similarity

A journey from “vector similarity ≠ relevance” to building a reasoning-based RAG system that actually understands documents Photo by Becca Tapert on Unsplash Introduction Retrieval-Augmented Generation (RAG) has become a foundational pattern for building AI systems that can answer questions over private data. Traditionally, RAG relies on vector embeddings to retrieve relevant chunks of text, which are then passed to a language model for generation. However, as systems scale and use cases become more complex, a new paradigm is emerging: Vectorless RAG , also known as reasoning-based retrieval . Instead of relying on embeddings and similarity search, vectorless RAG navigates information like a human would — following structure, reasoning step-by-step, and dynamically deciding where to look n