webdicio.com
robots.txt
Robots Exclusion Standard data for webdicio.com
Resource Scan
Scan Details
| Site Domain | webdicio.com |
| Base Domain | webdicio.com |
| Scan Status | Ok |
| Last Scan | 2026-03-26T23:26:32+00:00 |
| Next Scan | 2026-04-25T23:26:32+00:00 |
Last Scan
| Scanned | 2026-03-26T23:26:32+00:00 |
| URL | https://webdicio.com/robots.txt |
| Domain IPs | 104.21.59.63, 172.67.216.159, 2606:4700:3034::6815:3b3f, 2606:4700:3036::ac43:d89f |
| Response IP | 172.67.216.159 |
| Found | Yes |
| Hash | fa5ce64f9807cd60f0e0c877d718b28c7c5053e54e80737768b024f9e4352999 |
| SimHash | 4e34f970e777 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /search |
| Disallow | /admin |
| Disallow | /search?* |
| Disallow | /search?search= |
| Disallow | /*.pdf$ |
| Disallow | /? |
| Disallow | /*? |
| Disallow | /*?page= |
| Disallow | /cgi-bin* |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://webdicio.com/sitemap.xml |