twoja-gazetka.pl
robots.txt
Robots Exclusion Standard data for twoja-gazetka.pl
Resource Scan
Scan Details
Site Domain | twoja-gazetka.pl |
Base Domain | twoja-gazetka.pl |
Scan Status | Ok |
Last Scan | 2024-10-05T12:52:13+00:00 |
Next Scan | 2024-10-12T12:52:13+00:00 |
Last Scan
Scanned | 2024-10-05T12:52:13+00:00 |
URL | https://twoja-gazetka.pl/robots.txt |
Domain IPs | 104.26.12.195, 104.26.13.195, 172.67.74.250, 2606:4700:20::681a:cc3, 2606:4700:20::681a:dc3, 2606:4700:20::ac43:4afa |
Response IP | 104.26.12.195 |
Found | Yes |
Hash | 485403e6cd6e55f3d27feee2272e5556176adb1c65f0cbd0647dfaacb56caca1 |
SimHash | 24455cb2ab53 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin |
Disallow | /n/* |
Disallow | *taghash* |
Disallow | *cutterhash* |
Disallow | *open-street-map* |
Disallow | /user* |
Disallow | *?*hash=* |
Other Records
Field | Value |
---|---|
sitemap | https://twoja-gazetka.pl/sitemap-index.xml |