brooklynrail.org
robots.txt

Robots Exclusion Standard data for brooklynrail.org

Resource Scan

Scan Details

Site Domain brooklynrail.org
Base Domain brooklynrail.org
Scan Status Ok
Last Scan2024-05-19T15:18:35+00:00
Next Scan 2024-06-18T15:18:35+00:00

Last Scan

Scanned2024-05-19T15:18:35+00:00
URL https://brooklynrail.org/robots.txt
Domain IPs 104.21.11.201, 172.67.167.66, 2606:4700:3031::ac43:a742, 2606:4700:3033::6815:bc9
Response IP 104.21.11.201
Found Yes
Hash da50dd00164f1762c8ff02394abdb1b8d6b099d8814c8bc789e8baec81841a28
SimHash 9369b1670e6f

Groups

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
semrushbot
semrushbot/1.1~bl
linkpadbot
ahrefsbot/5.1
chinaclaw
custo
disco
ecatch
eirgrabber
emailsiphon
emailwolf
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
interget
jetcar
larbin
leechftp
navroad
nearsite
netants
netspider
netzip
octopus
pagegrabber
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
voideye
webauto
webcopier
webfetch
webleacher
webreaper
websauger
webstripper
webwhacker
webzip
wget
widow
wwwoffle
zeus
gptbot
google-extended
ccbot
chatgpt-user
anthropic-ai
omgilibot
omgili
facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow /support-beta
Disallow /unpublished
Disallow /t/*
Disallow /id/*

Other Records

Field Value
sitemap https://brooklynrail.org/sitemap.xml
sitemap https://brooklynrail.org/sitemap_contributors.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file