mitch.science
robots.txt
Robots Exclusion Standard data for mitch.science
Resource Scan
Scan Details
Site Domain | mitch.science |
Base Domain | mitch.science |
Scan Status | Ok |
Last Scan | 2025-10-07T23:56:44+00:00 |
Next Scan | 2025-11-06T23:56:44+00:00 |
Last Scan
Scanned | 2025-10-07T23:56:44+00:00 |
URL | https://mitch.science/robots.txt |
Domain IPs | 104.21.87.141, 172.67.169.225, 2606:4700:3031::ac43:a9e1, 2606:4700:3036::6815:578d |
Response IP | 104.21.87.141 |
Found | Yes |
Hash | 7a6f3e23f00519ca269fe015beaf39905d77c51a8a9887e14635688a89d49ad8 |
SimHash | 44354952cd94 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /.github/ |
Disallow | /.phan/ |
Disallow | /assets/ |
Disallow | /backup/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /logs/ |
Disallow | /system/ |
Disallow | /tests/ |
Disallow | /tmp/ |
Disallow | /user/ |
Disallow | /vendor/ |
Disallow | /webserver-configs/ |
Allow | /user/pages/ |
Allow | /user/themes/ |
Allow | /user/images/ |
Allow | / |
Allow | *.css$ |
Allow | *.js$ |
Allow | /system/*.js$ |
Warnings
- `content-signal` is not a known field.
Comments