cgian.com
robots.txt
Robots Exclusion Standard data for cgian.com
Resource Scan
Scan Details
| Site Domain | cgian.com |
| Base Domain | cgian.com |
| Scan Status | Ok |
| Last Scan | 2026-01-13T20:50:52+00:00 |
| Next Scan | 2026-01-20T20:50:52+00:00 |
Last Scan
| Scanned | 2026-01-13T20:50:52+00:00 |
| URL | https://cgian.com/robots.txt |
| Domain IPs | 35.214.213.30 |
| Response IP | 35.214.213.30 |
| Found | Yes |
| Hash | 04b276531cf1d84ab32ac8a605b381879d4778b99156911ede44df1b0288ba4c |
| SimHash | 690908008393 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
| Disallow | */feed/ |
| Disallow | /admin/ |
| Disallow | /cgi-bin/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://cgian.com/sitemap_index.xml |