gov.edu.pl
robots.txt

Robots Exclusion Standard data for gov.edu.pl

Resource Scan

Scan Details

Site Domain gov.edu.pl
Base Domain gov.edu.pl
Scan Status Ok
Last Scan2026-01-24T22:41:28+00:00
Next Scan 2026-01-31T22:41:28+00:00

Last Scan

Scanned2026-01-24T22:41:28+00:00
URL https://gov.edu.pl/robots.txt
Domain IPs 104.21.37.180, 172.67.211.68, 2606:4700:3032::6815:25b4, 2606:4700:3036::ac43:d344
Response IP 104.21.37.180
Found Yes
Hash 2b9575d8e947c6626827943183c28f7fc4b54b434f5510ec1d1a6b12896422d7
SimHash 6035bb70808d

Groups

*

Rule Path
Allow /
Disallow /assets/
Disallow /styles/
Disallow /user/delete_cookies
Disallow /user/forgot_password
Disallow /memberlist.php
Disallow /search.php

applebot
awariorssbot
baiduspider
claudebot
gptbot
iboubot
meta-externalagent
meta-webindexer
petalbot
seekportbot
serankingbacklinksbot
terracotta

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gov.edu.pl/sitemap.xml