thecurriculumchoice.com
robots.txt
Robots Exclusion Standard data for thecurriculumchoice.com
Resource Scan
Scan Details
Site Domain | thecurriculumchoice.com |
Base Domain | thecurriculumchoice.com |
Scan Status | Ok |
Last Scan | 2024-09-20T10:02:27+00:00 |
Next Scan | 2024-09-27T10:02:27+00:00 |
Last Scan
Scanned | 2024-09-20T10:02:27+00:00 |
URL | https://thecurriculumchoice.com/robots.txt |
Domain IPs | 104.21.12.54, 172.67.193.175, 2606:4700:3030::6815:c36, 2606:4700:3035::ac43:c1af |
Response IP | 104.21.12.54 |
Found | Yes |
Hash | b2d8568501a4cdd68a007a64bd3b7cf367705ce7b5ebe9c60170448e2511345a |
SimHash | fe08dc4508a1 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /wp-login.php |
Disallow | /xmlrpc.php |
*
Rule | Path |
---|---|
Disallow | /*.doc$ |
Disallow | /*.pdf$ |
Disallow | /*.zip$ |
Other Records
Field | Value |
---|---|
sitemap | https://www.thecurriculumchoice.com/sitemap_index.xml |