el.drit.ch
robots.txt

Robots Exclusion Standard data for el.drit.ch

Resource Scan

Scan Details

Site Domain el.drit.ch
Base Domain drit.ch
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-09-21T08:53:39+00:00
Next Scan 2024-12-20T08:53:39+00:00

Last Successful Scan

Scanned2024-05-31T04:55:06+00:00
URL https://el.drit.ch/robots.txt
Domain IPs 139.162.185.90
Response IP 139.162.185.90
Found Yes
Hash f9a49f2d45ecbedd431e7bc26806826b70590fab0231a3cd15eb923d8533c754
SimHash 3a146d0148d4

Groups

*

Rule Path
Disallow /*/pancake.html?something=somethingelse$

Comments

  • this means any normal robot program will follow the disallow command below
  • This doesn't work for robots that don't recognise the wildcard * in a disallow line... however I'm pretty sure Google and Bing are ok with this. The $ is there to mark the end of the url. If you wanted to block a folder it'd look like /pancake_folder/