keepoffline.com
robots.txt

Robots Exclusion Standard data for keepoffline.com

Resource Scan

Scan Details

Site Domain keepoffline.com
Base Domain keepoffline.com
Scan Status Ok
Last Scan2025-09-17T03:01:35+00:00
Next Scan 2025-09-24T03:01:35+00:00

Last Scan

Scanned2025-09-17T03:01:35+00:00
URL https://keepoffline.com/robots.txt
Redirect https://www.keepoffline.com/robots.txt
Redirect Domain www.keepoffline.com
Redirect Base keepoffline.com
Domain IPs 207.244.227.223
Redirect IPs 207.244.227.223
Response IP 207.244.227.223
Found Yes
Hash afccbbb006347218be83c537ed9ba3c8272b703f3da2c5ec95a05d19d1069634
SimHash 7d5158044ed3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow

httrack

Rule Path
Disallow /
Disallow /cgi-bin/

netcaptor

Rule Path
Disallow /
Disallow /cgi-bin/

offline explorer

Rule Path
Disallow /
Disallow /cgi-bin/

webcopier v3.3

Rule Path
Disallow /
Disallow /cgi-bin/

webcopier v3.2a

Rule Path
Disallow /
Disallow /cgi-bin/

webcopier

Rule Path
Disallow /
Disallow /cgi-bin/

webgather 3.0

Rule Path
Disallow /
Disallow /cgi-bin/

webzip

Rule Path
Disallow /
Disallow /cgi-bin/

wget

Rule Path
Disallow /
Disallow /cgi-bin/

zao

Rule Path
Disallow /
Disallow /cgi-bin/

zeus 2.6

Rule Path
Disallow /
Disallow /cgi-bin/

Other Records

Field Value
sitemap https://www.keepoffline.com/sitemap.xml
sitemap https://www.keepoffline.com/service-sitemap.xml
sitemap https://www.keepoffline.com/blog-list-sitemap.xml