geetmishra.com
robots.txt

Robots Exclusion Standard data for geetmishra.com

Resource Scan

Scan Details

Site Domain geetmishra.com
Base Domain geetmishra.com
Scan Status Ok
Last Scan2025-05-02T01:22:33+00:00
Next Scan 2025-06-01T01:22:33+00:00

Last Scan

Scanned2025-05-02T01:22:33+00:00
URL https://geetmishra.com/robots.txt
Domain IPs 2a02:4780:38:a68a:b61f:c49d:579a:ef21, 84.32.84.11
Response IP 93.127.196.64
Found Yes
Hash 8414f6c79afede1022030e6eb0d884121099bda1691e86894a3c4b199c8447fe
SimHash 0900d4a34613

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

scrubby

Rule Path
Allow /

baiduspider

Rule Path
Allow /

httrack
netcaptor
offline explorer
spiderku/0.9
steeler
webcopier v3.3
webcopier v3.2a
webcopier
webcawler
web downloader/4.9
web downloader/5.8
webgather 3.0
webstripper/2.56
webzip/3.65
webzip
wget
zao
zeus 2.6

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.geetmishra.com/sitemap.xml