meghakhan.com
robots.txt

Robots Exclusion Standard data for meghakhan.com

Resource Scan

Scan Details

Site Domain meghakhan.com
Base Domain meghakhan.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-11-27T09:56:47+00:00
Next Scan 2025-12-27T09:56:47+00:00

Last Successful Scan

Scanned2025-10-05T10:33:47+00:00
URL https://meghakhan.com/robots.txt
Redirect https://www.meghakhan.com/robots.txt
Redirect Domain www.meghakhan.com
Redirect Base meghakhan.com
Domain IPs 145.223.77.29, 2a02:4780:2b:1818:0:3b76:9b69:d
Redirect IPs 145.223.77.29, 2a02:4780:2b:1818:0:3b76:9b69:d
Response IP 145.223.77.29
Found Yes
Hash 48fd860b4113c670d8d85905ab9e3c2837d645580edbe654dca4b034a41be94d
SimHash 5dcddc060e93

Groups

*

Rule Path
Disallow

httrack

Rule Path
Disallow /

netcaptor

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

spiderku/0.9

Rule Path
Disallow /

steeler

Rule Path
Disallow /

webcopier v3.3

Rule Path
Disallow /

webcopier v3.2a

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcrawler

Rule Path
Disallow /

web downloader/4.9

Rule Path
Disallow /

web downloader/5.8

Rule Path
Disallow /

webgather 3.0

Rule Path
Disallow /

webstripper/2.56

Rule Path
Disallow /

webzip/3.65

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

zao

Rule Path
Disallow /

zeus 2.6

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/

*

Rule Path
Disallow /wp-admin/

Other Records

Field Value
sitemap https://www.meghakhan.com/sitemap.xml