hs-wismar.de
robots.txt

Robots Exclusion Standard data for hs-wismar.de

Resource Scan

Scan Details

Site Domain hs-wismar.de
Base Domain hs-wismar.de
Scan Status Ok
Last Scan2024-10-29T00:03:05+00:00
Next Scan 2024-11-28T00:03:05+00:00

Last Scan

Scanned2024-10-29T00:03:05+00:00
URL https://hs-wismar.de/robots.txt
Domain IPs 141.53.15.120
Response IP 141.53.15.120
Found Yes
Hash 54c421189b58e95b4d54c8bc54d9a60a8e0d3ae5664846f709c177ad474f7dc5
SimHash 6004db4365e2

Groups

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /bin/
Disallow /fileadmin/
Disallow /storages/protected/
Disallow /Packages/
Disallow /typo3/
Disallow /typo3conf/
Disallow /typo3_src/
Disallow /uploads/
Disallow /*?id=*
Disallow /*%26id%3D*
Allow /typo3conf/ext/
Allow /fileadmin/_processed_/
Allow /storages/hs-wismar/

Comments

  • robots.txt für HS Wismar
  • Disallow crawling at all ...
  • Disallow: /
  • Disallow folders
  • Disallow for non-realurl URLs
  • Allow specific folders