agh.edu.pl
robots.txt

Robots Exclusion Standard data for agh.edu.pl

Resource Scan

Scan Details

Site Domain agh.edu.pl
Base Domain agh.edu.pl
Scan Status Ok
Last Scan2024-09-24T11:47:15+00:00
Next Scan 2024-10-24T11:47:15+00:00

Last Scan

Scanned2024-09-24T11:47:15+00:00
URL https://agh.edu.pl/robots.txt
Redirect https://www.agh.edu.pl/robots.txt
Redirect Domain www.agh.edu.pl
Redirect Base agh.edu.pl
Domain IPs 149.156.96.150, 2001:6d8:10:118c::6096
Redirect IPs 149.156.96.150, 2001:6d8:10:118c::6096
Response IP 149.156.96.150
Found Yes
Hash d558dc2b59004309de8ea9506c5a2b0f93e7770e93d42d94365138bb42a47afb
SimHash a10043d61fe1

Groups

*

Rule Path Comment
Allow / -
Disallow /typo3/ -
Disallow /typo3conf/ -
Allow /typo3conf/ext/ -
Allow /typo3temp/ -
Disallow /*?id=* non speaking URLs
Disallow /*tx_powermail_pi1 no powermail thanks pages
Disallow /*tx_form_formframework no forms

Other Records

Field Value
sitemap https://www.agh.edu.pl/sitemap.xml?sitemap=pages&cHash=922faec96d94de9598b673896b3d9bbf

Comments

  • folders
  • parameters
  • Disallow: /*cHash # no cHash