compareraja.in
robots.txt

Robots Exclusion Standard data for compareraja.in

Resource Scan

Scan Details

Site Domain compareraja.in
Base Domain compareraja.in
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-09-08T04:20:08+00:00
Next Scan 2024-12-07T04:20:08+00:00

Last Successful Scan

Scanned2023-11-13T04:40:20+00:00
URL https://compareraja.in/robots.txt
Redirect https://www.compareraja.in/robots.txt
Redirect Domain www.compareraja.in
Redirect Base compareraja.in
Domain IPs 35.154.216.61
Redirect IPs 35.154.216.61
Response IP 35.154.216.61
Found Yes
Hash b078b5e4d5e32eff1565117c0a6be3d728b9cf27a74273134bfdebf7c1476488
SimHash 4608ca70ce73

Groups

*

Rule Path
Allow /WebResource.axd?*
Allow /ScriptResource.axd?*
Allow /blog/*?*
Allow /styles/*?*
Allow /scripts/*?*

googlebot-image

Rule Path
Disallow
Disallow /grabber/
Disallow /*searchparam%3D

etaospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

exabot

Rule Path
Disallow *?&minPrice=*
Disallow /search?*
Disallow *?color=*
Disallow *?colourvariant=*
Disallow *?categoryId=*

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.compareraja.in/sitemap-index.xml