thenatureofhome.com
robots.txt

Robots Exclusion Standard data for thenatureofhome.com

Resource Scan

Scan Details

Site Domain thenatureofhome.com
Base Domain thenatureofhome.com
Scan Status Ok
Last Scan2024-09-22T08:39:38+00:00
Next Scan 2024-09-29T08:39:38+00:00

Last Scan

Scanned2024-09-22T08:39:38+00:00
URL https://thenatureofhome.com/robots.txt
Domain IPs 104.16.150.108, 104.16.151.108, 2606:4700::6810:966c, 2606:4700::6810:976c
Response IP 104.16.151.108
Found Yes
Hash 75211a8a330ce766198d3fe57152464a8532670d14858ce9d07b00748ff8860f
SimHash 595d5900eb72

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /readme.html

anthropicbot

Rule Path
Disallow /

claude

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

webpilotbot

Rule Path
Disallow /

httrack

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

curl

Rule Path
Disallow /

wget

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thenatureofhome.com/post-sitemap.xml
sitemap https://thenatureofhome.com/page-sitemap.xml
sitemap https://thenatureofhome.com/sitemap_index.xml