pl-gazetki.com
robots.txt
Robots Exclusion Standard data for pl-gazetki.com
Resource Scan
Scan Details
Site Domain | pl-gazetki.com |
Base Domain | pl-gazetki.com |
Scan Status | Ok |
Last Scan | 2024-11-18T00:04:59+00:00 |
Next Scan | 2024-11-25T00:04:59+00:00 |
Last Scan
Scanned | 2024-11-18T00:04:59+00:00 |
URL | https://pl-gazetki.com/robots.txt |
Domain IPs | 104.21.55.102, 172.67.147.118, 2606:4700:3031::6815:3766, 2606:4700:3036::ac43:9376 |
Response IP | 104.21.55.102 |
Found | Yes |
Hash | 6bc8f11f0b560ddf50ac66e114b76afd0908bb6019be528a7f7f0a64d08920a5 |
SimHash | 24455c3089d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin |
Disallow | /n/* |
Disallow | *taghash* |
Disallow | *cutterhash* |
Disallow | *open-street-map* |
Disallow | /user* |
Disallow | *?*hash=* |
Other Records
Field | Value |
---|---|
sitemap | https://pl-gazetki.com/sitemap-index.xml |