Live
Black Hat USADark ReadingBlack Hat AsiaAI Businessv0.20.3Ollama Releasestrunk/06cee8b2f9c6b2c10076efb3082adb7c2605a98c: [vllm hash update] update the pinned vllm hash (#179531)PyTorch ReleasesAI startup Rocket offers vibe McKinsey-style reports at a fraction of the costTechCrunch AIChatGPT Now Crawls 3.6x More Than Googlebot: What 24M Requests Reveal - Search Engine JournalGoogle News: ChatGPTYour Claude Code is Starving, the Food’s Scattered All Over Your Org, and Some of it is StaleTowards AItrunk/5e79c7376a212f6abc628dc596ddec1fcf67e1cb: Update third_party/kineto submodule to 4826a43 (#179492)PyTorch ReleasesMistral Introduces "Voxtral TTS": An Open-Weight Text-to-Voice Model Capable Of Cloning Any Voice From 3 Seconds Of Audio, Runs In 9 Languages, & Beats Elevenlabs Flash V2.5 With A 68.4% Human Preference Win Rate.Reddit r/LocalLLaMAAI chatbots programmed to validate users relying on mental health advice, experts warn - FOX 10 PhoenixGNews AI mental healthThe Agentic AI: How Autonomous AI Systems Are Rewriting the Rules of Work, Business, and TechnologyTowards AI[R] Agentic AI and Occupational Displacement: A Multi-Regional Task Exposure Analysis (236 occupations, 5 US metros)Reddit r/MachineLearningBefore Word2Vec: The Strange, Fascinating Road from Counting Words to Learning MeaningTowards AIAI Agents Are Calling Restaurants. Restaurants Can’t Talk Back.Towards AIBlack Hat USADark ReadingBlack Hat AsiaAI Businessv0.20.3Ollama Releasestrunk/06cee8b2f9c6b2c10076efb3082adb7c2605a98c: [vllm hash update] update the pinned vllm hash (#179531)PyTorch ReleasesAI startup Rocket offers vibe McKinsey-style reports at a fraction of the costTechCrunch AIChatGPT Now Crawls 3.6x More Than Googlebot: What 24M Requests Reveal - Search Engine JournalGoogle News: ChatGPTYour Claude Code is Starving, the Food’s Scattered All Over Your Org, and Some of it is StaleTowards AItrunk/5e79c7376a212f6abc628dc596ddec1fcf67e1cb: Update third_party/kineto submodule to 4826a43 (#179492)PyTorch ReleasesMistral Introduces "Voxtral TTS": An Open-Weight Text-to-Voice Model Capable Of Cloning Any Voice From 3 Seconds Of Audio, Runs In 9 Languages, & Beats Elevenlabs Flash V2.5 With A 68.4% Human Preference Win Rate.Reddit r/LocalLLaMAAI chatbots programmed to validate users relying on mental health advice, experts warn - FOX 10 PhoenixGNews AI mental healthThe Agentic AI: How Autonomous AI Systems Are Rewriting the Rules of Work, Business, and TechnologyTowards AI[R] Agentic AI and Occupational Displacement: A Multi-Regional Task Exposure Analysis (236 occupations, 5 US metros)Reddit r/MachineLearningBefore Word2Vec: The Strange, Fascinating Road from Counting Words to Learning MeaningTowards AIAI Agents Are Calling Restaurants. Restaurants Can’t Talk Back.Towards AI
AI NEWS HUBbyEIGENVECTOREigenvector

Gemma-4 E4B model's vision seems to be surprisingly poor

Reddit r/LocalLLaMAby /u/specji https://www.reddit.com/user/specjiApril 6, 20262 min read0 views
Source Quiz

The E4B model is performing very poorly in my tests and since no one seems to be talking about it that I had to unlurk myself and post this. Its performing badly even compared to qwen3.5-4b. Can someone confirm or dis...uh...firm (?) My test suite has roughly 100 vision related tasks: single-turn with no tools, only an input image and prompt, but with definitive answers (not all of them are VQA though). Most of these tasks are upstream from any kind of agentic use case. To give a sense: there are tests where the inputs are screenshots from which certain text information has to be extracted, others are images on which the model has to perform some inference (for example: geoguessing on travel images, calculating total cost of a grocery list given an image of the relevant supermarket display

Could not retrieve the full article text.

Read on Reddit r/LocalLLaMA →
Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

llamamodeltransformer

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Gemma-4 E4B…llamamodeltransformervaluationmarketagenticReddit r/Lo…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 278 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models