k12workbook.com
robots.txt

Robots Exclusion Standard data for k12workbook.com

Resource Scan

Scan Details

Site Domain k12workbook.com
Base Domain k12workbook.com
Scan Status Ok
Last Scan2025-10-03T08:50:35+00:00
Next Scan 2025-10-10T08:50:35+00:00

Last Scan

Scanned2025-10-03T08:50:35+00:00
URL https://k12workbook.com/robots.txt
Domain IPs 216.137.182.105
Response IP 216.137.182.105
Found Yes
Hash f44ad66bfae33171fc3878b80cb79996c0410cacf272f5fcbdc55978eef09c1b
SimHash b20e1d5887f7

Groups

*

Rule Path
Disallow /client/
Disallow /includes/
Disallow /data/

Other Records

Field Value
sitemap https://k12workbook.com/sitemap-index.xml

Comments

  • If the site is installed within a folder such as at
  • e.g. www.example.com/site/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /site/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml