404media.co
robots.txt

Robots Exclusion Standard data for 404media.co

Resource Scan

Scan Details

Site Domain 404media.co
Base Domain 404media.co
Scan Status Ok
Last Scan2024-11-14T07:12:29+00:00
Next Scan 2024-11-21T07:12:29+00:00

Last Scan

Scanned2024-11-14T07:12:29+00:00
URL https://404media.co/robots.txt
Redirect https://www.404media.co/robots.txt
Redirect Domain www.404media.co
Redirect Base 404media.co
Domain IPs 178.128.137.126
Redirect IPs 151.101.131.7, 151.101.195.7, 151.101.3.7, 151.101.67.7, 2a04:4e42:200::775, 2a04:4e42:400::775, 2a04:4e42:600::775, 2a04:4e42::775
Response IP 199.232.47.7
Found Yes
Hash c29c3aa47940a5afc59754059c01bc0d35f7b8bba122349ac14e7347ad01c300
SimHash e0145da4e513

Groups

*

Rule Path
Disallow /ghost/
Disallow /email/
Disallow /members/api/comments/counts/
Disallow /r/
Disallow /webmentions/receive/

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.404media.co/sitemap.xml