newsfilter.io
robots.txt

Robots Exclusion Standard data for newsfilter.io

Resource Scan

Scan Details

Site Domain newsfilter.io
Base Domain newsfilter.io
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-02T19:54:10+00:00
Next Scan 2024-07-01T19:54:10+00:00

Last Successful Scan

Scanned2022-12-08T14:21:37+00:00
URL https://newsfilter.io/robots.txt
Domain IPs 13.33.88.120, 13.33.88.20, 13.33.88.30, 13.33.88.75
Response IP 13.33.88.30
Found Yes
Hash af1cf05dc14266c14b3132976d79feab8ba3137955a3bff2d4222f2215af74fd
SimHash e8108a02e7f3

Groups

*

Rule Path
Allow /
Disallow /legal/terms
Disallow /legal/privacy

Other Records

Field Value
sitemap https://newsfilter.io/sitemap.xml