crdj.in
robots.txt
Robots Exclusion Standard data for crdj.in
Resource Scan
Scan Details
| Site Domain | crdj.in |
| Base Domain | crdj.in |
| Scan Status | Ok |
| Last Scan | 2026-03-15T07:04:29+00:00 |
| Next Scan | 2026-04-14T07:04:29+00:00 |
Last Scan
| Scanned | 2026-03-15T07:04:29+00:00 |
| URL | https://crdj.in/robots.txt |
| Domain IPs | 2.57.91.16, 2a02:4780:84:d0b6:7b19:e397:72b:20f7, 2a02:4780:85:4382:3c08:b433:b433:c4f0, 88.222.222.27 |
| Response IP | 93.127.187.158 |
| Found | Yes |
| Hash | 393a72458653052d69fac3a214b7bd7d98c9a18b19b2574866bf2e277e54ff9a |
| SimHash | a15c5c7023c8 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /cache/ |
| Disallow | /components/ |
| Disallow | /images/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /libraries/ |
| Disallow | /media/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /templates/ |
| Disallow | /tmp/ |
| Disallow | /xmlrpc/ |
Other Records
| Field | Value |
|---|---|
| sitemap | http://cdn.attracta.com/sitemap/2270629.xml.gz |
Comments