findpostoffice.in
robots.txt

Robots Exclusion Standard data for findpostoffice.in

Resource Scan

Scan Details

Site Domain findpostoffice.in
Base Domain findpostoffice.in
Scan Status Ok
Last Scan2025-11-11T09:16:32+00:00
Next Scan 2025-11-18T09:16:32+00:00

Last Scan

Scanned2025-11-11T09:16:32+00:00
URL https://findpostoffice.in/robots.txt
Domain IPs 104.21.56.214, 172.67.156.7, 2606:4700:3031::ac43:9c07, 2606:4700:3032::6815:38d6
Response IP 104.21.56.214
Found Yes
Hash a43c35beacc424622825079e45d20581de6baa5c9f0e18b46fe86db1b3090cd6
SimHash 6688c8d0c69f

Groups

*

Rule Path
Disallow

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /
Disallow /*?*

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://findpostoffice/sitemaps/sitemap.xml

Comments

  • Allow all bots (default rule)
  • Block specific bots
  • Limit crawling frequency (well-behaved bots will follow this)
  • Disallow bots from crawling dynamically generated search result pages
  • Sitemap location