/.well-known/

Log In Sign Up

en.samuraijournalist.com
robots.txt

Robots Exclusion Standard data for en.samuraijournalist.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	en.samuraijournalist.com
Base Domain	samuraijournalist.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-06-07T09:38:03+00:00
Next Scan	2025-09-05T09:38:03+00:00

Last Successful Scan

Scanned	2024-01-22T09:16:25+00:00
URL	https://en.samuraijournalist.com/robots.txt
Domain IPs	104.26.2.183, 104.26.3.183, 172.67.73.245, 2606:4700:20::681a:2b7, 2606:4700:20::681a:3b7, 2606:4700:20::ac43:49f5
Response IP	104.26.2.183
Found	Yes
Hash	3168f75fe02d13cec088dd3ec8f8bef5822ef9196266a8ba1ba773d6225ffcbb
SimHash	41451960c713

Groups

adsbot-google

Rule

Path

Disallow

grapeshot

Rule

Path

Disallow

gptbot

Rule

Path

Disallow

/

*

Rule

Path

Allow

/

Back to top

Other Records

Field

Value

sitemap

https://www.samuraijournalist.com/sitemap.xml

Back to top