Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessAI Tools That Actually Pay You Back: A Developer's Guide to Monetizing AIDev.to AIThe $6 Million Shockwave: How DeepSeek Just Broke the AI MonopolyMedium AIHow I Got My First Freelance Client in 3 Days (Using AI) — Beginner Guide (India 2026)Medium AIWhy Your Resume Gets Rejected Before a Human Sees It (And How to Fix It)Dev.to AII've Been Saying RAG Is Dead Since 2020Medium AIAI Print-on-Demand Passive Income: ₹400-2K/Design from HomeDev.to AIFaceless YouTube Automation with AI: Complete Guide 2026Dev.to AIAutomate Churn Analysis and Win-Backs with AI-Powered PersonalizationDev.to AINo-Code vs. Custom Code: Which One Should You Actually Use for Your MVP?Dev.to AIBizNode's semantic memory (Qdrant) makes your bot smarter over time — it remembers past conversations and answers intelligently using RAGDev.to AIAI Side Hustles for Indians 2026: 10 Ways to Earn ₹50K+/MonthDev.to AIRegional Content Creation 2026: Earn ₹20K-3L in Hindi/Tamil/TeluguDev.to AIBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessAI Tools That Actually Pay You Back: A Developer's Guide to Monetizing AIDev.to AIThe $6 Million Shockwave: How DeepSeek Just Broke the AI MonopolyMedium AIHow I Got My First Freelance Client in 3 Days (Using AI) — Beginner Guide (India 2026)Medium AIWhy Your Resume Gets Rejected Before a Human Sees It (And How to Fix It)Dev.to AII've Been Saying RAG Is Dead Since 2020Medium AIAI Print-on-Demand Passive Income: ₹400-2K/Design from HomeDev.to AIFaceless YouTube Automation with AI: Complete Guide 2026Dev.to AIAutomate Churn Analysis and Win-Backs with AI-Powered PersonalizationDev.to AINo-Code vs. Custom Code: Which One Should You Actually Use for Your MVP?Dev.to AIBizNode's semantic memory (Qdrant) makes your bot smarter over time — it remembers past conversations and answers intelligently using RAGDev.to AIAI Side Hustles for Indians 2026: 10 Ways to Earn ₹50K+/MonthDev.to AIRegional Content Creation 2026: Earn ₹20K-3L in Hindi/Tamil/TeluguDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

Read More, Think More: Revisiting Observation Reduction for Web Agents

arXiv cs.CLby Masafumi Enomoto, Ryoma Obara, Haochen Zhang, Masafumi OyamadaApril 4, 20261 min read0 views
Source Quiz

arXiv:2604.01535v1 Announce Type: new Abstract: Web agents based on large language models (LLMs) rely on observations of web pages -- commonly represented as HTML -- as the basis for identifying available actions and planning subsequent steps. Prior work has treated the verbosity of HTML as an obstacle to performance and adopted observation reduction as a standard practice. We revisit this trend and demonstrate that the optimal observation representation depends on model capability and thinking token budget: (1) compact observations (accessibility trees) are preferable for lower-capability models, while detailed observations (HTML) are advantageous for higher-capability models; moreover, increasing thinking tokens further amplifies the benefit of HTML. (2) Our error analysis suggests that

View PDF HTML (experimental)

Abstract:Web agents based on large language models (LLMs) rely on observations of web pages -- commonly represented as HTML -- as the basis for identifying available actions and planning subsequent steps. Prior work has treated the verbosity of HTML as an obstacle to performance and adopted observation reduction as a standard practice. We revisit this trend and demonstrate that the optimal observation representation depends on model capability and thinking token budget: (1) compact observations (accessibility trees) are preferable for lower-capability models, while detailed observations (HTML) are advantageous for higher-capability models; moreover, increasing thinking tokens further amplifies the benefit of HTML. (2) Our error analysis suggests that higher-capability models exploit layout information in HTML for better action grounding, while lower-capability models suffer from increased hallucination under longer inputs. We also find that incorporating observation history improves performance across most models and settings, and a diff-based representation offers a token-efficient alternative. Based on these findings, we suggest practical guidelines: adaptively select observation representations based on model capability and thinking token budget, and incorporate observation history using diff-based representations.

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2604.01535 [cs.CL]

(or arXiv:2604.01535v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2604.01535

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Masafumi Enomoto [view email] [v1] Thu, 2 Apr 2026 02:14:47 UTC (325 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelannounce

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Read More, …modellanguage mo…announceavailableanalysistrendarXiv cs.CL

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 183 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models