waysidepublishing.com
robots.txt

Robots Exclusion Standard data for waysidepublishing.com

Resource Scan

Scan Details

Site Domain waysidepublishing.com
Base Domain waysidepublishing.com
Scan Status Ok
Last Scan2026-02-21T01:11:21+00:00
Next Scan 2026-02-28T01:11:21+00:00

Last Scan

Scanned2026-02-21T01:11:21+00:00
URL https://waysidepublishing.com/robots.txt
Redirect https://www.waysidepublishing.com/robots.txt
Redirect Domain www.waysidepublishing.com
Redirect Base waysidepublishing.com
Domain IPs 52.20.147.12
Redirect IPs 104.20.22.22, 172.66.165.58, 2606:4700:10::6814:1616, 2606:4700:10::ac42:a53a
Response IP 172.66.165.58
Found Yes
Hash b592408a7e5d95c86938a868fd87e71a2006ca233701757b8e21556fb3f614a5
SimHash 41501d523592

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.waysidepublishing.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.waysidepublishing.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/