halalgaijin.com
robots.txt
Robots Exclusion Standard data for halalgaijin.com
Resource Scan
Scan Details
Site Domain | halalgaijin.com |
Base Domain | halalgaijin.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2025-10-16T20:23:59+00:00 |
Next Scan | 2026-01-14T20:23:59+00:00 |
Last Successful Scan
Scanned | 2024-11-28T07:18:40+00:00 |
URL | https://halalgaijin.com/robots.txt |
Domain IPs | 104.21.39.96, 172.67.144.9, 2606:4700:3032::ac43:9009, 2606:4700:3037::6815:2760 |
Response IP | 172.67.144.9 |
Found | Yes |
Hash | f8fe0cdece974b16a964ed7d83057fc09f3d06dcef102ec3396b0f51759f527b |
SimHash | f0145505af53 |
Groups
*
Rule | Path |
---|---|
Disallow | /ghost/ |
Disallow | /p/ |
Disallow | /email/ |
Disallow | /r/ |
Disallow | /webmentions/receive/ |
Other Records
Field | Value |
---|---|
sitemap | https://halalgaijin.com/sitemap.xml |