newsbreak.jp
robots.txt

Robots Exclusion Standard data for newsbreak.jp

Resource Scan

Scan Details

Site Domain newsbreak.jp
Base Domain newsbreak.jp
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-03-23T22:58:22+00:00
Next Scan 2024-06-21T22:58:22+00:00

Last Successful Scan

Scanned2022-10-13T19:54:07+00:00
URL https://newsbreak.jp/robots.txt
Redirect https://www.newsbreak.jp/robots.txt
Redirect Domain www.newsbreak.jp
Redirect Base newsbreak.jp
Response IP 52.196.182.204, 13.114.252.224, 13.114.82.82
Found Yes
Hash 96181ab4a08adee32066102ff1a7fce0e2ec6417c205884ab1b937253cb5b4ea
SimHash 35155f124d72

Groups

*

Rule Path
Disallow /_api/
Disallow /privacy
Disallow /terms
Disallow /redirect-external
Disallow /me
Disallow /following
Disallow /following/

Other Records

Field Value
sitemap https://www.newsbreak.com.br/sitemap.xml