pardonsnowden.org
robots.txt

Robots Exclusion Standard data for pardonsnowden.org

Resource Scan

Scan Details

Site Domain pardonsnowden.org
Base Domain pardonsnowden.org
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-10-28T18:43:55+00:00
Next Scan 2025-11-27T18:43:55+00:00

Last Successful Scan

Scanned2025-09-28T18:49:32+00:00
URL https://pardonsnowden.org/robots.txt
Redirect https://bantuanstrsara.my/robots.txt
Redirect Domain bantuanstrsara.my
Redirect Base bantuanstrsara.my
Domain IPs 203.223.152.141
Redirect IPs 203.223.152.141
Response IP 203.223.152.141
Found Yes
Hash 19f922c75abe29cffca95da01ff473632b600efe2f69dadb8c1dd4be0110a1be
SimHash c3249c02e6d2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /?s=
Disallow /search/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow */trackback/
Disallow */feed/
Disallow */comments/
Disallow /*/feed/$
Disallow /*/*/feed/$
Disallow /*/*/*/feed/$

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://bantuanstrsara.my/sitemap.xml

Comments

  • Robots.txt for bantuanstrsara.my
  • Allow all search engines to crawl the site
  • Block bad bots
  • Sitemap location (update this after generating sitemap)