skuke.net
robots.txt
Robots Exclusion Standard data for skuke.net
Resource Scan
Scan Details
Site Domain | skuke.net |
Base Domain | skuke.net |
Scan Status | Ok |
Last Scan | 2024-11-15T06:59:49+00:00 |
Next Scan | 2024-11-22T06:59:49+00:00 |
Last Scan
Scanned | 2024-11-15T06:59:49+00:00 |
URL | https://skuke.net/robots.txt |
Domain IPs | 185.56.234.12 |
Response IP | 185.56.234.12 |
Found | Yes |
Hash | 4c25405c496bba1d1a9db7d8947c660e9db152fffeba06799c6ae843369f707b |
SimHash | 4c01c4514352 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /img/ |
Disallow | /css/ |
Disallow | /menu/ |
Disallow | /news?id=* |
Disallow | /to/ |
Allow | *.js |
Allow | *.css |
Other Records
Field | Value |
---|---|
sitemap | https://skuke.net/sitemap.xml |
sitemap | https://skuke.net/sitemap_latest.xml |
Warnings
- `clean-param` is not a known field.
- `host` is not a known field.