halalgaijin.com
robots.txt

Robots Exclusion Standard data for halalgaijin.com

Resource Scan

Scan Details

Site Domain halalgaijin.com
Base Domain halalgaijin.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-16T20:23:59+00:00
Next Scan 2026-01-14T20:23:59+00:00

Last Successful Scan

Scanned2024-11-28T07:18:40+00:00
URL https://halalgaijin.com/robots.txt
Domain IPs 104.21.39.96, 172.67.144.9, 2606:4700:3032::ac43:9009, 2606:4700:3037::6815:2760
Response IP 172.67.144.9
Found Yes
Hash f8fe0cdece974b16a964ed7d83057fc09f3d06dcef102ec3396b0f51759f527b
SimHash f0145505af53

Groups

*

Rule Path
Disallow /ghost/
Disallow /p/
Disallow /email/
Disallow /r/
Disallow /webmentions/receive/

Other Records

Field Value
sitemap https://halalgaijin.com/sitemap.xml