greatcleanhome.com
robots.txt
Robots Exclusion Standard data for greatcleanhome.com
Resource Scan
Scan Details
Site Domain | greatcleanhome.com |
Base Domain | greatcleanhome.com |
Scan Status | Ok |
Last Scan | 2024-09-24T10:16:23+00:00 |
Next Scan | 2024-10-01T10:16:23+00:00 |
Last Scan
Scanned | 2024-09-24T10:16:23+00:00 |
URL | https://greatcleanhome.com/robots.txt |
Domain IPs | 104.21.87.176, 172.67.170.124, 2606:4700:3033::6815:57b0, 2606:4700:3035::ac43:aa7c |
Response IP | 104.21.87.176 |
Found | Yes |
Hash | cdc9911790c5eae2f4a57a3e890ac3c52ccf343099bec98f23c7d5e97b41e7f8 |
SimHash | 63005840cbb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-login.php |
Disallow | /goto/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Other Records
Field | Value |
---|---|
sitemap | https://greatcleanhome.com/sitemap_index.xml |