webtechnocraft.com
robots.txt
Robots Exclusion Standard data for webtechnocraft.com
Resource Scan
Scan Details
| Site Domain | webtechnocraft.com |
| Base Domain | webtechnocraft.com |
| Scan Status | Ok |
| Last Scan | 2025-10-06T11:43:42+00:00 |
| Next Scan | 2025-11-05T11:43:42+00:00 |
Last Scan
| Scanned | 2025-10-06T11:43:42+00:00 |
| URL | https://webtechnocraft.com/robots.txt |
| Domain IPs | 104.21.73.204, 172.67.192.36, 2606:4700:3030::ac43:c024, 2606:4700:3036::6815:49cc |
| Response IP | 172.67.192.36 |
| Found | Yes |
| Hash | cc7d8db57b7b348915150ea0988f7a39cae640f61c84534eb64d6285e7eea495 |
| SimHash | 4435c953cc94 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin/ |
Other Records
| Field | Value |
|---|---|
| sitemap | http://www.webtechnocraft.com/sitemap.xml |
Warnings
- `content-signal` is not a known field.
Comments