urbansurvival.com
robots.txt

Robots Exclusion Standard data for urbansurvival.com

Resource Scan

Scan Details

Site Domain urbansurvival.com
Base Domain urbansurvival.com
Scan Status Ok
Last Scan2026-02-24T13:50:59+00:00
Next Scan 2026-03-03T13:50:59+00:00

Last Scan

Scanned2026-02-24T13:50:59+00:00
URL https://urbansurvival.com/robots.txt
Domain IPs 190.92.156.129
Response IP 190.92.156.129
Found Yes
Hash bbf362ab1b61f8268fd6e8277612b4fec15cd6d02dd15830f2a21fc029be221f
SimHash 0b1c8c128cb7

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

xai-crawler

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

commoncrawl

Rule Path
Allow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://urbansurvival.com/sitemap_index.xml

Comments

  • Ure oh so very clever - whatchu doing sniffing my site, Weasel?
  • ======================================================
  • Global rules – applies to all unspecified bots
  • ======================================================
  • ======================================================
  • Truth-seeking AI crawlers – explicit overrides
  • ======================================================
  • ======================================================
  • Hostile scrapers – explicit denial
  • ======================================================