cps-check.com
robots.txt
Robots Exclusion Standard data for cps-check.com
Resource Scan
Scan Details
Site Domain | cps-check.com |
Base Domain | cps-check.com |
Scan Status | Ok |
Last Scan | 2024-09-21T07:39:47+00:00 |
Next Scan | 2024-09-28T07:39:47+00:00 |
Last Scan
Scanned | 2024-09-21T07:39:47+00:00 |
URL | https://cps-check.com/robots.txt |
Domain IPs | 104.21.8.15, 172.67.156.160, 2606:4700:3031::ac43:9ca0, 2606:4700:3034::6815:80f |
Response IP | 104.21.8.15 |
Found | Yes |
Hash | 134524fdc113ced84f36c42b9e5c22e96fe6dfc7b5b2fc109430d856cd04999a |
SimHash | 4508dcd09791 |
Groups
yandex
Rule | Path |
---|---|
Disallow | *.html?* |
Disallow | *? |
Disallow | /js |
Disallow | /img |
Disallow | /css |
Disallow | /memes |
Disallow | /cdn-cgi/ |
Disallow | /privacy |
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
*
Rule | Path |
---|---|
Disallow | *.html?* |
Disallow | *? |
Disallow | /js |
Disallow | /img |
Disallow | /css |
Disallow | /memes |
Disallow | /cdn-cgi/ |
Disallow | /privacy |
Other Records
Field | Value |
---|---|
sitemap | https://cps-check.com/sitemap.xml |
Warnings
- `host` is not a known field.