Hintrix – Web scraping API that returns content and AI search audit

Hacker News AI Topby slreportApril 3, 20266 min read1 views

Article URL: https://hintrix.com/ Comments URL: https://news.ycombinator.com/item?id=47624857 Points: 1 # Comments: 0

Give your AI agent eyes on the web

hintrix lets AI agents read any website, extract structured data, crawl entire domains, and audit pages for AI search visibility. One API call returns clean Markdown and GEO diagnostics with actionable fixes.

request & response

$ curl -X POST https://hintrix.com/v1/scrape \  -H "X-API-Key: hx_live_sk_..." \  -d '{"url": "https://example.com", "mode": ["content", "audit"]}'

$ curl -X POST https://hintrix.com/v1/scrape \  -H "X-API-Key: hx_live_sk_..." \  -d '{"url": "https://example.com", "mode": ["content", "audit"]}'

{ "agent": "reveal", "content": { "markdown": "# About Us\n\nWe build tools...", "word_count": 890 }, "audit": { "geo_score": 72, "tech_score": 85, "issues": [...] }, "credits_used": 2 }`

// npm install hintrix import { Hintrix } from 'hintrix';

// npm install hintrix import { Hintrix } from 'hintrix';

const hx = new Hintrix('hx_live_sk_...'); const page = await hx.scrape('https://example.com', { mode: ['content', 'audit'] });_

console.log(page.content.markdown); console.log(page.audit.geo_score); // 72

// Crawl an entire site and collect all pages const { pages } = await hx.crawlAndCollect('https://example.com', { max_pages: 50, onProgress: (s) => console.log(${s.pages_crawled} pages...), });`

Content

LLM-ready output from any page

Clean Markdown, HTML, or plain text. Metadata, links, structured data, and Schema.org included.

Static HTML and JS-rendered pages
JSON APIs and data endpoints
SPAs, React, Next.js, Vue
Automatic JS detection

GEO Audit

Will AI search engines cite this page?

GEO readiness score with 80+ evidence-backed checks. Issues ranked by impact with copy-paste fixes.

AI bot accessibility analysis
Citation readiness scoring
E-E-A-T and entity signals
Structured data validation

Read any website as context

Your agent scrapes a URL and gets clean Markdown that fits directly into a prompt. No parsing, no HTML cleanup.

Research across entire domains

Crawl a full website and collect content from every page. Build knowledge bases, feed RAG pipelines, or summarize documentation.

Extract structured data

Pull prices, products, contacts, or any data as JSON from any page — including SPAs, JSON endpoints, and JS-rendered content.

Audit AI search visibility

Check if a page will be cited by Perplexity, ChatGPT Search, or Google AI Overviews. Get a score, issues, and copy-paste fixes.

Handle JavaScript-heavy pages

SPAs, React apps, and client-rendered pages that return empty HTML to normal crawlers. Rendered automatically when needed.

API endpoints

Five endpoints. Content extraction, auditing, structured data, batch processing, and multi-page crawls.

POST

/v1/scrapeagent: glance | reveal

Single URL to Markdown, HTML, or text. Optionally include GEO audit. Handles JS rendering automatically when needed.

1 credit (content) · 2 credits (content + audit) · +1 screenshot

POST

/v1/auditagent: reveal

GEO readiness audit with scores, issues, and fixes. Checks AI bot access, citation readiness, E-E-A-T signals, structured data.

2 credits

POST

/v1/extractagent: pinch

Structured data from any page. Define a schema with CSS selectors or let auto-detection handle it. Works with SPAs and JSON endpoints.

2 credits

POST

/v1/batchagent: sweep

Submit multiple URLs in a single request. Async job processes them in parallel with a single status endpoint to poll.

same as per-URL cost · billed per item · +1 screenshot per URL

POST

/v1/crawlagent: sweep

Multi-page crawl with depth control. Async job with progress tracking. Content and audit for entire sites.

1 credit / page · +1 audit · JS rendering included

Capabilities

What you can do with hintrix.

▸ Markdown outputClean, structured content ready for any LLM pipeline

▸ GEO scoringEvidence-backed readiness score for AI search visibility

▸ JS renderingFull browser rendering for SPAs, detected automatically

▸ JSON & API contentRead and parse JSON endpoints, RSS feeds, sitemaps

▸ Schema.org extractionJSON-LD, microdata, and RDFa parsed and returned

▸ Fix suggestionsCopy-paste fixes with severity, impact, and effort ratings

▸ Node.js SDKnpm install hintrix — typed client with retries and polling helpers

▸ MCP serverNative integration for Claude Code, Cursor, and Windsurf

▸ robots.txt analysisWhich AI bots are allowed, blocked, or rate-limited

▸ SSRF protectionMulti-layer defense against server-side request forgery

▸ PageSpeed scoresCore Web Vitals and performance metrics alongside audit results

▸ ScreenshotsOpt-in full-page screenshots for visual verification and diffing (+1 credit, not stored)

▸ Content diffingCompare page content across scrapes — see what changed. Previous versions stored 7 days per URL. No extra cost.

▸ Link health reportsBroken link detection across pages and entire domains

▸ llms.txt generationAuto-generate and audit llms.txt files for AI-agent discoverability

Pricing

Pay per use. No subscriptions. Credits valid for 30 days — any new purchase extends by 30 days.

Free

500 credits

on signup, no card

$5 pack

2,500 credits

$0.002 / credit

$12 pack

$12

7,500 credits

$0.0016 / credit

$29 pack

$29

20,000 credits

$0.00145 / credit

Credit costs

ActionCredits

/v1/scrape1 /v1/scrape + audit2 /v1/audit2 /v1/extract2 /v1/batchper-URL cost /v1/crawl1 / page (+1 audit per page) JS renderingincluded

Cost examples

Scrape 1,000 blog posts for a RAG pipeline1,000 cr · $2.00

GEO audit 100 pages for a client200 cr · $0.40

Crawl a JS-rendered shop, 500 pages500 cr · $1.00

Extract product data from 200 listings400 cr · $0.80

What makes hintrix different

Most crawl APIs give you content or diagnostics. hintrix returns both in a single request.

Content + diagnostics, not either/or

Other tools return raw Markdown or SEO data. hintrix gives you clean, LLM-ready content and a GEO audit with scores and fixes from the same API call.

Pay per use, not per month

No subscriptions. Buy credit packs when you need them. Start with 500 free credits on signup, no credit card required — get 500 more by sharing on X. Credits are valid for 30 days; any new purchase extends all credits by 30 days.

Built for AI agents

MCP server for Claude Code, Cursor, and Windsurf. Markdown output optimized for LLM context windows. Auto-detection of JS rendering needs.

Actionable, not just data

Every issue comes with a severity rating, impact estimate, and a fix you can copy and paste. No interpretation needed.

Frequently asked questions

Common questions about hintrix.

What domains are blocked?

Social media platforms (Facebook, Instagram, Twitter/X, LinkedIn, TikTok, YouTube, Reddit, Pinterest, Snapchat, Threads) and dark web domains are blocked for legal and ethical reasons.

Do you respect robots.txt?

Yes, by default. You can override this per request with respect_robots_txt: false for URLs where you have explicit permission.

What User-Agent does hintrix use?

HintrixBot/1.0 (+https://hintrix.com/bot). Website owners can allow or block this in their robots.txt.

Do I need to verify my email?

Yes. A verification email is sent on signup. You must verify before making API calls.

What happens when JS rendering is needed?

hintrix uses full browser rendering by default for reliable content extraction. You can disable it with wait_for_js: false for faster plain HTTP scraping. No extra cost either way.

Do credits expire?

Credits are valid for 30 days. Any new purchase extends the expiry of all your credits. Active users effectively never lose credits.

Can I get a refund?

All credit purchases are final and non-refundable. However, failed API requests (4xx/5xx errors, connection failures) are automatically refunded — once per URL per 24 hours. The first failure is free; repeated failures on the same URL are charged normally.

Original source

Hacker News AI Top

https://hintrix.com/

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 210 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Releases

ReleasesFresh

Artificial Intelligence - AI Update, April 3, 2026: AI News and Views From the Past Week - MarketingProfs

Artificial Intelligence - AI Update, April 3, 2026: AI News and Views From the Past Week MarketingProfs

Google News: AI

1mabout 12 hours ago

ReleasesLive

Using AI as your therapist?

We live in a world where we are scared of sharing our true feelings with others. Continue reading on Medium »

Medium AI

1mabout 1 hour ago

ReleasesFresh

How 2 downed jets show a critical vulnerability for the US as Iran war rages on

One crew member from a US fighter jet shot down over Iran has been rescued by US forces, multiple news outlets reported on Friday, citing two US officials, while a second crew member remains missing. A separate US aircraft was also hit near the Strait of Hormuz, though its pilot was rescued safely, according to the reports. Iran on Friday claimed to have shot down an American fighter jet, releasing photos of apparent wreckage of an F-15E, while the United States reportedly launched a...

SCMP Tech (Asia AI)

1mabout 11 hours ago

ReleasesLive

Stop Writing Rules for AI Agents

Stop Writing Rules for AI Agents Every developer building AI agents makes the same mistake: they write rules. "Don't do X." "Always do Y." Rules feel like control. But they are an illusion. Why Rules Fail Rules are static. Agents operate in dynamic environments. The moment reality diverges from your rule set it breaks. Behavior Over Rules Instead of telling your agent what NOT to do, design what it IS: The system prompt (identity, not restrictions) The tools available (capability shapes behavior) The feedback loops (what gets rewarded) The memory architecture A Real Example I built FORGE, an autonomous AI agent running 24/7. Early versions had dozens of rules. Every rule created a new edge case. The fix: stop writing rules, start designing behavior. FORGE's identity: orchestrator, not exec

Dev.to AI

1mabout 1 hour ago