gazetki.pl
robots.txt

Robots Exclusion Standard data for gazetki.pl

Resource Scan

Scan Details

Site Domain gazetki.pl
Base Domain gazetki.pl
Scan Status Ok
Last Scan2024-09-24T10:56:28+00:00
Next Scan 2024-10-01T10:56:28+00:00

Last Scan

Scanned2024-09-24T10:56:28+00:00
URL https://gazetki.pl/robots.txt
Redirect https://www.gazetki.pl/robots.txt
Redirect Domain www.gazetki.pl
Redirect Base gazetki.pl
Domain IPs 104.21.82.173, 172.67.160.62, 2606:4700:3030::ac43:a03e, 2606:4700:3037::6815:52ad
Redirect IPs 13.33.88.32, 13.33.88.39, 13.33.88.41, 13.33.88.46, 2600:9000:223b:5400:d:5a50:9400:93a1, 2600:9000:223b:6400:d:5a50:9400:93a1, 2600:9000:223b:6600:d:5a50:9400:93a1, 2600:9000:223b:7200:d:5a50:9400:93a1, 2600:9000:223b:9400:d:5a50:9400:93a1, 2600:9000:223b:a800:d:5a50:9400:93a1, 2600:9000:223b:aa00:d:5a50:9400:93a1, 2600:9000:223b:ea00:d:5a50:9400:93a1
Response IP 13.33.88.46
Found Yes
Hash 2d4a60636b4aaea6138dd9b8f3ba26155cb5e94c99249ba695773e58175bdf29
SimHash 6d43ef226539

Groups

*

Rule Path
Allow /
Disallow /wyszukaj/*
Disallow /shopping-list/
Disallow /click-out/
Disallow /index.php/click-out/
Disallow /admin/
Disallow /cdn-cgi/
Disallow /oferty/*?sort=relevance&page=*&*
Allow /oferty/*?sort=relevance&page=*
Disallow /oferty/*?page=*&*
Allow /oferty/*?page=*
Disallow /oferty/*?*
Disallow /marki/*?page=*&*
Allow /marki/*?page=*
Disallow /marki/*?*
Disallow /sklepy/*?page=*&*
Allow /sklepy/*?page=*
Disallow /sklepy/*?*
Disallow /miasta/*?page=*&*
Allow /miasta/*?page=*
Disallow /miasta/*?*
Disallow /najnowsze-gazetki?page=*&*
Allow /najnowsze-gazetki?page=*
Disallow /najnowsze-gazetki?*
Disallow /gazetki-ktorych-waznosc-prawie-uplynela?page=*&*
Allow /gazetki-ktorych-waznosc-prawie-uplynela?page=*
Disallow /gazetki-ktorych-waznosc-prawie-uplynela?*
Disallow /popularne-gazetki?page=*&*
Allow /popularne-gazetki?page=*
Disallow /popularne-gazetki?*
Disallow /polecane-gazetki?page=*&*
Allow /polecane-gazetki?page=*
Disallow /polecane-gazetki?*
Disallow /najnowsze-oferty?page=*&*
Allow /najnowsze-oferty?page=*
Disallow /najnowsze-oferty?*
Disallow /prawie-wygasle-oferty?page=*&*
Allow /prawie-wygasle-oferty?page=*
Disallow /prawie-wygasle-oferty?*
Disallow /najpopularniejsze-oferty?page=*&*
Allow /najpopularniejsze-oferty?page=*
Disallow /najpopularniejsze-oferty?*
Disallow /polecane-oferty?page=*&*
Allow /polecane-oferty?page=*
Disallow /polecane-oferty?*

adsbot-google

Rule Path
Disallow /click-out/
Disallow /admin/

*
adsbot-google

Rule Path
Disallow /cdn-cgi/bm/cv/
Disallow /cdn-cgi/challenge-platform/
Disallow /cdn-cgi/images/trace/
Disallow /cdn-cgi/rum
Disallow /cdn-cgi/scripts/
Disallow /cdn-cgi/styles/
Disallow /cdn-cgi/zaraz/

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
mj12bot
ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gazetki.pl/sitemap.xml