georgeinthestrand.com
robots.txt

Robots Exclusion Standard data for georgeinthestrand.com

Resource Scan

Scan Details

Site Domain georgeinthestrand.com
Base Domain georgeinthestrand.com
Scan Status Ok
Last Scan2024-06-16T14:56:27+00:00
Next Scan 2024-07-16T14:56:27+00:00

Last Scan

Scanned2024-06-16T14:56:27+00:00
URL https://www.georgeinthestrand.com/robots.txt
Domain IPs 23.44.4.163, 23.44.4.171, 2600:1413:a000::1735:218a, 2600:1413:a000::1735:2193
Response IP 23.44.4.171
Found Yes
Hash 02b6b60eebba970715b11db2fd1a55b96e9283b362fbe0eaa8a43b08e177e720
SimHash 1b075b654f51

Groups

*

Rule Path
Disallow /App_Data/
Disallow /masterpages/
Disallow /bin/
Disallow /config/
Disallow /css/
Disallow /data/
Disallow /js/
Disallow /images/
Disallow /includes/
Disallow /media/
Disallow /Properties/
Disallow /scripts/
Disallow /sitecore/
Disallow */sitecore/*
Disallow /usercontrols/
Disallow /xslt/
Disallow /Web.config

screaming frog seo spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.georgeinthestrand.com/sitemap.xml