georgeinthestrand.com
robots.txt

Robots Exclusion Standard data for georgeinthestrand.com

Resource Scan

Scan Details

Site Domain georgeinthestrand.com
Base Domain georgeinthestrand.com
Scan Status Ok
Last Scan2024-11-13T14:57:39+00:00
Next Scan 2024-12-13T14:57:39+00:00

Last Scan

Scanned2024-11-13T14:57:39+00:00
URL https://www.georgeinthestrand.com/robots.txt
Domain IPs 2600:1413:b000:6::17d5:2bca, 2600:1413:b000:6::17d5:2bdd, 96.17.96.19, 96.17.96.20
Response IP 23.200.218.115
Found Yes
Hash 02b6b60eebba970715b11db2fd1a55b96e9283b362fbe0eaa8a43b08e177e720
SimHash 1b075b654f51

Groups

*

Rule Path
Disallow /App_Data/
Disallow /masterpages/
Disallow /bin/
Disallow /config/
Disallow /css/
Disallow /data/
Disallow /js/
Disallow /images/
Disallow /includes/
Disallow /media/
Disallow /Properties/
Disallow /scripts/
Disallow /sitecore/
Disallow */sitecore/*
Disallow /usercontrols/
Disallow /xslt/
Disallow /Web.config

screaming frog seo spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.georgeinthestrand.com/sitemap.xml