extra.interia.pl
robots.txt

Robots Exclusion Standard data for extra.interia.pl

Resource Scan

Scan Details

Site Domain extra.interia.pl
Base Domain interia.pl
Scan Status Ok
Last Scan2024-11-14T12:48:20+00:00
Next Scan 2024-11-21T12:48:20+00:00

Last Scan

Scanned2024-11-14T12:48:20+00:00
URL https://extra.interia.pl/robots.txt
Domain IPs 217.74.71.147
Response IP 217.74.71.147
Found Yes
Hash d4d6af89458a0b6c2e4f312e1b62fc1a1213e6308016e0d7839f3b978303dcd6
SimHash 6e40d4a45b19

Groups

*

Rule Path
Disallow %2CspamId
Disallow %2CrepId
Disallow %2CsSort%2C1
Disallow /embed-video?
Disallow /ajax
Disallow /zglos-naduzycie/*
Disallow /ajax/zglos-naduzycie/*
Disallow /komentarze/odpowiedz/formularz
Disallow /komentarze/odpowiedz/wyslij
Disallow /udostepnij-komentarz
Disallow /key%3D*
Disallow /script
Disallow /y%3D*
Disallow /ad.js*
Disallow /ocen%2C*
Disallow *%2Cth%2C*
Disallow *%2CaddCForm%2C*
Disallow *%2Cs%2C*
Disallow /getVideoInfo
Disallow /embed-video
Disallow /logowanie
Disallow /rejestracja
Disallow */ankieta
Disallow */wyniki-ankiety-
Disallow /pokaz-komentarz%2CpId%2C%POST%*
Disallow /forum/post%2CpId%2C%POST_ID%*
Disallow /newsamp2-
Disallow /wpisamp2-
Disallow /przepisamp2-
Disallow /gwiazdaamp2-
Disallow /zdjecieamp2
Disallow /videoamp2
Disallow /*?*parametr=*
Disallow /*?*f=*
Disallow /adc
Disallow /emotions-api
Disallow /*spamId%3D
Disallow /*%2CspamId%2C
Disallow /*repId%3D
Disallow /*%2CrepId%2C
Disallow /*sSort%3D
Disallow /*%2CsSort%2C

mediapartners-google

Rule Path
Allow /