top.pl
robots.txt

Robots Exclusion Standard data for top.pl

Resource Scan

Scan Details

Site Domain top.pl
Base Domain top.pl
Scan Status Ok
Last Scan2024-06-02T15:51:31+00:00
Next Scan 2024-06-09T15:51:31+00:00

Last Scan

Scanned2024-06-02T15:51:31+00:00
URL https://top.pl/robots.txt
Domain IPs 217.74.71.147
Response IP 217.74.71.147
Found Yes
Hash 19838b7f5470ab65441ad0fb209e4996eebfde82111aab6e79e24ad113af0b31
SimHash 6f0494a41b19

Groups

*

Rule Path
Disallow %2CspamId
Disallow %2CrepId
Disallow %2CsSort%2C1
Disallow /embed-video?
Disallow /ajax
Disallow /j/common
Disallow /zglos-naduzycie/*
Disallow /ajax/zglos-naduzycie/*
Disallow /komentarze/odpowiedz/formularz
Disallow /komentarze/odpowiedz/wyslij
Disallow /udostepnij-komentarz
Disallow /key%3D*
Disallow /script
Disallow /y%3D*
Disallow /ad.js*
Disallow /ocen%2C*
Disallow *%2Cth%2C*
Disallow *%2CaddCForm%2C*
Disallow *%2Cs%2C*
Disallow /getVideoInfo
Disallow /embed-video
Disallow /logowanie
Disallow /rejestracja
Disallow */ankieta
Disallow */wyniki-ankiety-
Disallow /pokaz-komentarz%2CpId%2C%POST%*
Disallow /forum/post%2CpId%2C%POST_ID%*
Disallow /newsamp2-
Disallow /wpisamp2-
Disallow /przepisamp2-
Disallow /gwiazdaamp2-
Disallow /zdjecieamp2
Disallow /videoamp2
Disallow /*?*parametr=*
Disallow /*?*f=*
Disallow /adc
Disallow /emotions-api
Disallow /*spamId%3D
Disallow /*%2CspamId%2C
Disallow /*repId%3D
Disallow /*%2CrepId%2C
Disallow /*sSort%3D
Disallow /*%2CsSort%2C

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://top.pl/sitemap/top.pl/top.pl-wiadomosci.xml.gz
sitemap https://top.pl/sitemap/top.pl-siteindex-2.xml.gz
sitemap https://top.pl/sitemap/top.pl-siteindex-1.xml.gz