cegra.press
robots.txt

Robots Exclusion Standard data for cegra.press

Resource Scan

Scan Details

Site Domain cegra.press
Base Domain cegra.press
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-05T11:59:37+00:00
Next Scan 2025-12-04T11:59:37+00:00

Last Successful Scan

Scanned2025-08-07T00:19:12+00:00
URL http://cegra.press/robots.txt
Domain IPs 31.177.76.32, 31.177.80.32
Response IP 31.177.76.32
Found Yes
Hash 5309c78b495c1dd146b0c1310b028f41acf9880c0eb729a47c4cd4d8ba6a1d20
SimHash 810083625eb3

Groups

*

Rule Path
Allow /
Disallow

mj12bot

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

archive.org-bot

Rule Path
Disallow /

hypercrawl

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

Warnings

  • `host` is not a known field.