theplancollection.com
robots.txt

Robots Exclusion Standard data for theplancollection.com

Resource Scan

Scan Details

Site Domain theplancollection.com
Base Domain theplancollection.com
Scan Status Ok
Last Scan2025-11-27T14:04:22+00:00
Next Scan 2025-12-04T14:04:22+00:00

Last Scan

Scanned2025-11-27T14:04:22+00:00
URL https://theplancollection.com/robots.txt
Redirect https://www.theplancollection.com/robots.txt
Redirect Domain www.theplancollection.com
Redirect Base theplancollection.com
Domain IPs 216.150.1.1
Redirect IPs 216.150.1.1, 216.150.16.1
Response IP 216.150.1.1
Found Yes
Hash 5de16e6e655ee30a50a0c23597efab65cb45805eb751a8122632a9776fdb1aba
SimHash 68500a028b13

Groups

*

Rule Path
Disallow /EmailTemplates
Disallow /SearchResultsTextFiles
Disallow /*.asp
Disallow /*.aspx
Disallow /*rss$
Disallow /*?rawUrl=*
Disallow /*?rawurl=*
Disallow /*save$
Disallow /house-plans/home-plan-*/print
Disallow /house-plans/modify-plan-*
Disallow /admin/
Disallow /admin/CKeditorUploads/Images/
Disallow /login
Disallow /my-account
Disallow /recover-password
Disallow /search-results
Disallow /my-shopping-cart
Disallow /*atom
Disallow */undefined

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

claude-user

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

yandex

Rule Path
Disallow /bundles*

Other Records

Field Value
sitemap https://www.theplancollection.com/sitemap.xml