hellopurbachal.com
robots.txt

Robots Exclusion Standard data for hellopurbachal.com

Resource Scan

Scan Details

Site Domain hellopurbachal.com
Base Domain hellopurbachal.com
Scan Status Ok
Last Scan2026-02-07T15:05:25+00:00
Next Scan 2026-02-14T15:05:25+00:00

Last Scan

Scanned2026-02-07T15:05:25+00:00
URL https://hellopurbachal.com/robots.txt
Domain IPs 2a02:4780:15:1bda:a4d4:165b:20df:8b96, 2a02:4780:39:265f:6cb4:26fe:33f1:376e, 84.32.84.26, 84.32.84.78
Response IP 77.37.115.127
Found Yes
Hash 827d0461d3b29bfc8e0a9656c00a3925b38061b695ec8db4b157c6607b3243d2
SimHash 6220ca3aafbe

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /cgi-bin/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /tag/
Disallow /author/
Disallow /category/

Other Records

Field Value
sitemap https://www.hellopurbachal.com/sitemap_index.xml

Comments

  • Block unnecessary and duplicate content paths
  • Block archive pages to avoid thin or duplicate content indexing
  • Sitemap location