colonmagi.com
robots.txt

Robots Exclusion Standard data for colonmagi.com

Resource Scan

Scan Details

Site Domain colonmagi.com
Base Domain colonmagi.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-17T01:48:16+00:00
Next Scan 2024-11-16T01:48:16+00:00

Last Successful Scan

Scanned2024-07-20T01:38:54+00:00
URL https://colonmagi.com/robots.txt
Domain IPs 34.202.151.185, 52.200.79.233, 54.158.12.24, 54.165.104.227, 54.208.187.92
Response IP 54.208.187.92
Found Yes
Hash 31c22a6c23b55e0abcea3b92f74280fe8c64ee279436a3c217a85c7f04f038a7
SimHash ab86e482c371

Groups

mj12bot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

spbot

Rule Path
Disallow /

amazon-kendra

Product Comment
amazon-kendra Amazon Kendra Web Crawler
Rule Path Comment
Disallow / disallow access to any pages

*

Rule Path
Disallow /admin/
Disallow /cfformprotect/
Disallow /cftags/
Disallow /common/
Disallow /config/
Disallow /export/
Disallow /fonts/
Disallow /frameworks/
Disallow /img/
Disallow /includes/
Disallow /layouts/
Disallow /model/
Disallow /modules/
Disallow /services/
Disallow /src/
Disallow /udf/
Disallow /WEB-INF/