/.well-known/

Log In Sign Up

samuraijournalist.com
robots.txt

Robots Exclusion Standard data for samuraijournalist.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	samuraijournalist.com
Base Domain	samuraijournalist.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-06-14T16:18:30+00:00
Next Scan	2024-09-12T16:18:30+00:00

Last Successful Scan

Scanned	2024-02-16T16:12:39+00:00
URL	https://samuraijournalist.com/robots.txt
Redirect	https://www.samuraijournalist.com/robots.txt
Redirect Domain	www.samuraijournalist.com
Redirect Base	samuraijournalist.com
Domain IPs	104.26.2.183, 104.26.3.183, 172.67.73.245, 2606:4700:20::681a:2b7, 2606:4700:20::681a:3b7, 2606:4700:20::ac43:49f5
Redirect IPs	151.101.1.55, 151.101.129.55, 151.101.193.55, 151.101.65.55, 2a04:4e42:200::311, 2a04:4e42:400::311, 2a04:4e42:600::311, 2a04:4e42::311
Response IP	199.232.45.55
Found	Yes
Hash	3168f75fe02d13cec088dd3ec8f8bef5822ef9196266a8ba1ba773d6225ffcbb
SimHash	41451960c713

Groups

adsbot-google

Rule

Path

Disallow

grapeshot

Rule

Path

Disallow

gptbot

Rule

Path

Disallow

/

*

Rule

Path

Allow

/

Back to top

Other Records

Field

Value

sitemap

https://www.samuraijournalist.com/sitemap.xml

Back to top