newsbox-inc.jp
robots.txt

Robots Exclusion Standard data for newsbox-inc.jp

Resource Scan

Scan Details

Site Domain newsbox-inc.jp
Base Domain newsbox-inc.jp
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-11-04T08:50:23+00:00
Next Scan 2025-01-03T08:50:23+00:00

Last Successful Scan

Scanned2024-08-14T07:14:15+00:00
URL https://newsbox-inc.jp/robots.txt
Domain IPs 104.21.61.178, 172.67.212.154, 2606:4700:3033::6815:3db2, 2606:4700:3037::ac43:d49a
Response IP 172.67.212.154
Found Yes
Hash e4b73af5c29aefa5bef36d7f6133e73c63c458803fdfd9e4a82aa9d4af31f586
SimHash 0a1c9d60ae9b

Groups

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://newsbox-inc.jp/sitemap.xml