Getting Started With QueryBurst: How to Audit Your Website's AI Visibility

AI visibility depends on how well a site's content, structure, and metadata align with the retrieval pipelines used by AI search systems (Google AI Overview, ChatGPT, Perplexity). Measuring it requires crawling the site, extracting entities and relationships, building a knowledge graph, and testing content against simulated AI queries.

QueryBurst automates this end-to-end — this guide covers setup, running a first crawl, and choosing which reports to explore.

Step 1: Subscribe to Pro

QueryBurst's crawling, intelligence pipeline, and analysis tools require a Pro subscription. From the home screen, click Upgrade to Pro and complete the checkout. You can cancel anytime.

click the "Upgrade To Pro" button to upgrade your QueryBurst account.

Step 2: Add Your Site and Run a Crawl

  1. Click Sites & Crawls in the sidebar (or the home screen button)
  2. Switch to the + New Crawl tab
  3. Enter your domain URL or select a GSC property (if Search Console is connected)
  4. Configure crawl scope — discover subdirectories, exclude paths, set max pages
  5. Click Start Indexing
start a new crawl in QueryBurst

QueryBurst will crawl the site, fetch every reachable page, convert it to clean markdown, split content into chunks, and build the internal link graph. Paginated URLs and non-content file types are excluded automatically.

Crawl configuration

Before starting, you can fine-tune what gets crawled:

  • Subdirectory discovery — Maps your site's structure from the sitemap so you can include or exclude entire sections (e.g. skip /blog/archive or /fr/)
  • Exclude paths — Manually exclude specific paths by prefix, exact match, or substring
  • File type exclusions — PDFs, images, and static assets are excluded by default
  • Max pages — Slider from 50 to 5,000 (or your remaining monthly quota)

For full details on all configuration options, see the Sites & Crawls doc.

Crawl limits

  • Up to 5,000 pages per site
  • 10,000 pages per month included with Pro

The crawl typically completes within a few minutes for small sites. Larger crawls (1,000+ pages) can take 1–4 hours for the full pipeline, but core reports become available roughly halfway through.

Step 3: Connect Search Console (Optional)

Connecting Google Search Console unlocks keyword-level data across several reports.

  1. Go to Account → Integrations (or click Connect on the home screen)
  2. Authenticate with your Google account
  3. Select the Search Console property that matches your site

What it enables

ReportWithout GSCWith GSC
Search Console ExplorerNot availableFull conversational GSC analysis with AI agent
Topic CoverageCustom prompts onlyGSC keyword alignment + expected topics
Page InsightsBasic page metricsPerformance data + keyword breakdown
GSC PerformanceNot availableNatural language GSC queries

GSC is not required for the core analysis pipeline, intelligence extraction, or any of the AI tools.

Step 4: Site Intelligence Builds Automatically

After the crawl completes, the Site Intelligence pipeline runs automatically. This is the core analysis that powers the Knowledge Graph, Flow, and many other reports.

What the pipeline extracts

  1. Primary entities — A lightweight pass identifies the main entity on each page
  2. Full knowledge extraction — A deeper pass extracts all entities, relationships (subject–predicate–object triples), summaries, page types, and target queries
  3. Entity deduplication — Embedding similarity + review merges duplicate entity names
  4. Topic clusters — Semantic clustering of pages by topic, with redundancy, dilution, and missed-link detection

A spinner appears next to Site Intelligence in the sidebar while the pipeline is running. Some reports (Topics, Issues) remain disabled until intelligence is available.

Re-running intelligence

Intelligence is content-hash-cached. When you re-crawl a site, only pages with changed content are re-processed. Unchanged pages are skipped automatically.

Step 5: Connect Cloudflare for AI Crawler Logs (Optional)

If your site uses Cloudflare, you can connect your account to track AI bot activity (GPTBot, ClaudeBot, etc.) across your pages.

  1. Enter your Cloudflare API token
  2. Select the zone that matches your domain
  3. AI crawler data appears on the dashboard and per-page overview

This is independent of the crawl and site intelligence pipeline — it monitors real bot traffic to your site.

What to Explore First

Once your crawl is complete and site intelligence is ready, here's a recommended order:

1. Page Reports

Browse your crawled pages in Page Reports. Check structure scores, SEO scores, and use semantic search to find pages by topic. Click any page to access the full analysis suite — Overview, AI Simulation, Entities, Links, Structure, Technical, and Content Preview.

2. Site Intelligence (Knowledge Graph)

Open Site ntelligence in the sidebar. This is the main view for understanding what AI extracts from your site — entity profiles, statements, relationships, and how consistently your content defines your key entities. Explore the Entity Universe scatter plot, review dedup groups, and click into entity profiles.

Check your internal linking health. The Overview shows structural vs content depth, anchor text distribution, hub pages, dead ends, and orphans. Use the Link Explorer for interactive graph traversal.

4. Topics

Once site intelligence is ready, Topics shows how entity prominence distributes across your site — which entities are concentrated, which are diluted, and where peer links are missing. Issues surfaces heuristic problems like duplicate titles, keyword cannibalisation, and orphan topics.

5. Answer Spy

Run your first AI criteria analysis — enter your service, product, or content type and see what criteria AI models use to evaluate recommendations in your niche. Then investigate how well your site covers those criteria.

Every report is accessible from the sidebar. Here's what each section does:

Sidebar ItemWhat It Does
Site IntelligenceKnowledge Graph — entities, relationships, topics, claims, questions, comparison pages, architecture
Link AnalysisInternal link health — depth, anchors, orphans, link graph explorer
TopicsEntity prominence, topic map, entity hubs (requires intelligence)
IssuesHeuristic content and architecture issues (requires intelligence)
Page ReportsContent Health index of all crawled pages with per-page analysis
Answer SpyAI criteria extraction and site investigation
OptimizerCitation/retrieval simulation (ChatGPT-style pipeline)
Content LabDeep content editor with chunk analysis and competitor compare
QFO SimulatorQuery Fan-Out — how AI decomposes complex queries
RAG ChatAsk questions about your site content with citations
ClaimsVerify marketing claims and check factual consistency
SearchFull-text search across all crawled pages
Site DiscoveryMulti-turn AI exploration of your site
AI Content DetectorCompare your content to AI-generated baselines
AI Crawler LogsTrack AI bot activity (requires Cloudflare)
Search ConsoleConversational GSC analysis (requires Search Console)

Requirements Summary

FeatureRequires
Core analysis, Intelligence, Link Analysis, AI toolsPro subscription + completed crawl
Topics, IssuesSite Intelligence pipeline complete
Search Console ExplorerGoogle Search Console connected
AI Crawler LogsCloudflare connected
Topic Coverage (keyword mode), Page Insights (performance data)Google Search Console connected