ssceindhoven.tue.nl
robots.txt

Robots Exclusion Standard data for ssceindhoven.tue.nl

Resource Scan

Scan Details

Site Domain ssceindhoven.tue.nl
Base Domain tue.nl
Scan Status Ok
Last Scan2024-11-07T08:50:36+00:00
Next Scan 2024-11-21T08:50:36+00:00

Last Scan

Scanned2024-11-07T08:50:36+00:00
URL https://ssceindhoven.tue.nl/robots.txt
Domain IPs 167.235.218.203, 2a01:4f8:c011:eec::1
Response IP 167.235.218.203
Found Yes
Hash 66b457f0e9574f85fc23dd8afd6ad065ec325143730227e8a322196517e28153
SimHash 6c0c4fc36a71

Groups

*

Rule Path
Allow /
Disallow /t3lib/
Disallow /typo3/
Disallow /typo3conf/
Allow /typo3conf/ext/*/Resources/Public/
Disallow /typo3temp/
Allow /typo3temp/compressor/
Allow /typo3temp/pics/
Disallow /storage/
Disallow /search-results/*
Disallow /*?no_cache=*
Disallow /*?type=*
Disallow /*?search=*
Disallow /*?filter=*
Disallow /*?id=*

Other Records

Field Value
crawl-delay 3

fasterfox

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

wget

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

Comments

  • Sitemap: http://www.example.tld/sitemap.xml # Your Sitemap
  • Sitemap: http://www.example.tld/rss.xml # Your RSS Feed