newspasky.ru
robots.txt
Robots Exclusion Standard data for newspasky.ru
Resource Scan
Scan Details
Site Domain | newspasky.ru |
Base Domain | newspasky.ru |
Scan Status | Ok |
Last Scan | 2024-10-01T11:55:06+00:00 |
Next Scan | 2024-10-08T11:55:06+00:00 |
Last Scan
Scanned | 2024-10-01T11:55:06+00:00 |
URL | https://newspasky.ru/robots.txt |
Domain IPs | 104.21.27.161, 172.67.143.62, 2606:4700:3031::6815:1ba1, 2606:4700:3037::ac43:8f3e |
Response IP | 172.67.143.62 |
Found | Yes |
Hash | af6212248a64c80eee29ccae5e7aabdc107e81a84dd5ec6d8dc6ebb6888ee8e1 |
SimHash | 408d9c50e133 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /plugins/ |
Disallow | /search/ |
Disallow | /cart/ |
Disallow | */?s= |
Disallow | *sort%3D |
Disallow | *view%3D |
Disallow | *utm%3D |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://newspasky.ru/sitemap.xml |
Warnings
- `clean-param` is not a known field.
- `host` is not a known field.