crdj.in
robots.txt

Robots Exclusion Standard data for crdj.in

Resource Scan

Scan Details

Site Domain crdj.in
Base Domain crdj.in
Scan Status Ok
Last Scan2026-03-15T07:04:29+00:00
Next Scan 2026-04-14T07:04:29+00:00

Last Scan

Scanned2026-03-15T07:04:29+00:00
URL https://crdj.in/robots.txt
Domain IPs 2.57.91.16, 2a02:4780:84:d0b6:7b19:e397:72b:20f7, 2a02:4780:85:4382:3c08:b433:b433:c4f0, 88.222.222.27
Response IP 93.127.187.158
Found Yes
Hash 393a72458653052d69fac3a214b7bd7d98c9a18b19b2574866bf2e277e54ff9a
SimHash a15c5c7023c8

Groups

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /images/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/
Disallow /xmlrpc/

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/2270629.xml.gz

Comments

  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove