caprocat.com
robots.txt
Robots Exclusion Standard data for caprocat.com
Resource Scan
Scan Details
Site Domain | caprocat.com |
Base Domain | caprocat.com |
Scan Status | Ok |
Last Scan | 2024-05-29T03:04:10+00:00 |
Next Scan | 2024-06-28T03:04:10+00:00 |
Last Scan
Scanned | 2024-05-29T03:04:10+00:00 |
URL | https://caprocat.com/robots.txt |
Domain IPs | 18.161.6.23, 18.161.6.61, 18.161.6.82, 18.161.6.98 |
Response IP | 108.157.52.119 |
Found | Yes |
Hash | bdd072c4f9251f3e77f8b72ebcda4ef93946dd6bf53e2331a26e07b681f3b02c |
SimHash | 6b0c4878c338 |
Groups
*
Rule | Path |
---|---|
Allow | /*.css$ |
Allow | /*.js$ |
Disallow | /*/?slug=* |
Disallow | /*/*/?slug=* |
Disallow | /wp-login |
Disallow | /*/feed/ |
Disallow | /*/trackback/ |
Disallow | /*/attachment/ |
Disallow | /?attachment_id* |
Disallow | /comments/ |
Disallow | /xmlrpc.php |
Disallow | /*?s= |
Disallow | /?s=* |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://caprocat.com/sitemap_index.xml |