404media.co
robots.txt
Robots Exclusion Standard data for 404media.co
Resource Scan
Scan Details
Site Domain | 404media.co |
Base Domain | 404media.co |
Scan Status | Ok |
Last Scan | 2024-11-14T07:12:29+00:00 |
Next Scan | 2024-11-21T07:12:29+00:00 |
Last Scan
Scanned | 2024-11-14T07:12:29+00:00 |
URL | https://404media.co/robots.txt |
Redirect | https://www.404media.co/robots.txt |
Redirect Domain | www.404media.co |
Redirect Base | 404media.co |
Domain IPs | 178.128.137.126 |
Redirect IPs | 151.101.131.7, 151.101.195.7, 151.101.3.7, 151.101.67.7, 2a04:4e42:200::775, 2a04:4e42:400::775, 2a04:4e42:600::775, 2a04:4e42::775 |
Response IP | 199.232.47.7 |
Found | Yes |
Hash | c29c3aa47940a5afc59754059c01bc0d35f7b8bba122349ac14e7347ad01c300 |
SimHash | e0145da4e513 |
Groups
*
Rule | Path |
---|---|
Disallow | /ghost/ |
Disallow | /email/ |
Disallow | /members/api/comments/counts/ |
Disallow | /r/ |
Disallow | /webmentions/receive/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.404media.co/sitemap.xml |