willabelmont.nocujmy.pl
robots.txt

Robots Exclusion Standard data for willabelmont.nocujmy.pl

Resource Scan

Scan Details

Site Domain willabelmont.nocujmy.pl
Base Domain nocujmy.pl
Scan Status Ok
Last Scan2025-09-18T21:32:29+00:00
Next Scan 2025-10-18T21:32:29+00:00

Last Scan

Scanned2025-09-18T21:32:29+00:00
URL https://willabelmont.nocujmy.pl/robots.txt
Domain IPs 104.21.43.187, 172.67.184.75, 2606:4700:3035::ac43:b84b, 2606:4700:3037::6815:2bbb
Response IP 104.21.43.187
Found Yes
Hash 6c5f2d57f51de34a0cfc6b3fa7d1fb65b5edceaf5960bcbd686e2d29619d9e12
SimHash 4a05ca4666b2

Groups

*

Rule Path
Allow /

yandex

Rule Path
Disallow /regulamin/
Disallow /cookies/

Other Records

Field Value
crawl-delay 5

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yacybot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

lexxebot

Rule Path
Disallow /

psbot/0.1

Rule Path
Disallow /

psbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

e-societyrobot

Rule Path
Disallow /

tmcrawler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

Other Records

Field Value
sitemap /sitemap-categories.xml
sitemap /sitemap-objects.xml