indopakarts.com
robots.txt

Robots Exclusion Standard data for indopakarts.com

Resource Scan

Scan Details

Site Domain indopakarts.com
Base Domain indopakarts.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-07-09T10:39:23+00:00
Next Scan 2024-10-07T10:39:23+00:00

Last Successful Scan

Scanned2023-06-16T09:24:13+00:00
URL https://indopakarts.com/robots.txt
Domain IPs 13.250.129.152, 2406:da18:9d0:143e:8e74:1b1a:98b9:2813, 2406:da18:9d0:143f:29e7:ae24:cfea:e9bb, 54.151.156.30
Response IP 3.67.181.148
Found Yes
Hash 8ff455502660a315c2acb8ca7c76c19ecd8952f4c7877c7dcee37ef57835e3cf
SimHash b51f44400ca2

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/

*

Rule Path
Allow /
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /readme.html$

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

awariobot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://indopakarts.com/sitemap.xml
sitemap https://indopakarts.com/sitemap.rss

Comments

  • BEGIN Magic robots.txt
  • ---------------------------
  • General
  • Link analyzers
  • Downloaders
  • ---------------------------
  • END Magic robots.txt