semacraft.com
robots.txt
Robots Exclusion Standard data for semacraft.com
Resource Scan
Scan Details
| Site Domain | semacraft.com |
| Base Domain | semacraft.com |
| Scan Status | Ok |
| Last Scan | 2025-12-12T03:06:05+00:00 |
| Next Scan | 2026-01-11T03:06:05+00:00 |
Last Scan
| Scanned | 2025-12-12T03:06:05+00:00 |
| URL | https://semacraft.com/robots.txt |
| Domain IPs | 104.21.19.211, 172.67.190.53, 2606:4700:3032::6815:13d3, 2606:4700:3033::ac43:be35 |
| Response IP | 172.67.190.53 |
| Found | Yes |
| Hash | f0a12b26c4b66e5477bcfca8a8f5c79483b137658d713413489b5bd02cb20d6e |
| SimHash | e23d1952c5d4 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /cache/ |
| Disallow | /cli/ |
| Disallow | /components/ |
| Disallow | /images/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /libraries/ |
| Disallow | /logs/ |
| Disallow | /media/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /templates/ |
| Disallow | /tmp/ |
Warnings
- `content-signal` is not a known field.
Comments