linux.icu
robots.txt

Robots Exclusion Standard data for linux.icu

Resource Scan

Scan Details

Site Domain linux.icu
Base Domain linux.icu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-07-18T18:14:03+00:00
Next Scan 2025-10-16T18:14:03+00:00

Last Successful Scan

Scanned2023-03-09T09:31:40+00:00
URL https://linux.icu/robots.txt
Domain IPs 104.21.15.30, 172.67.161.71, 2606:4700:3030::ac43:a147, 2606:4700:3037::6815:f1e
Response IP 172.67.161.71
Found Yes
Hash c8de92b69383cf4869740ec1318f283b72c533d1ad4eac245c2bae176306d034
SimHash 38095c05ad11

Groups

*

Rule Path
Disallow /ghost/
Disallow /p/

Other Records

Field Value
sitemap http://linux.icu/sitemap.xml