buildinghistory.org
robots.txt

Robots Exclusion Standard data for buildinghistory.org

Resource Scan

Scan Details

Site Domain buildinghistory.org
Base Domain buildinghistory.org
Scan Status Ok
Last Scan2025-06-08T16:34:18+00:00
Next Scan 2025-07-08T16:34:18+00:00

Last Scan

Scanned2025-06-08T16:34:18+00:00
URL https://buildinghistory.org/robots.txt
Domain IPs 185.194.90.26
Response IP 185.194.90.26
Found Yes
Hash 942d08350aeec1d3bea68d683d19daa37075c7f7fadb5791d11b2d3c8b20d6cb
SimHash e02607f823a3

Groups

*

Rule Path
Disallow /rss.xml
Disallow /404.htm
Disallow /sitesearch.htm
Disallow /css/
Disallow /images/
Disallow /includes/
Disallow /archives/images/
Disallow /articles/images/
Disallow /bath/images/
Disallow /bath/styles/
Disallow /books/images/
Disallow /bristol/images/
Disallow /bristol/styles/
Disallow /buildings/images/
Disallow /church/images/
Disallow /distantpast/images/
Disallow /jean/images/
Disallow /jean/family/
Disallow /mickaston/Images/
Disallow /mickaston/styles/
Disallow /primary/images/
Disallow /primary/defoe/images/
Disallow /primary/inns/images/
Disallow /primary/magalotti/images/
Disallow /style/images/

aipbot/1.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

iltrovatore-setaccio

Rule Path
Disallow /

iltrovatore-setaccio/1.2

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

szukacz/1.5

Rule Path
Disallow /

weatherbot

Rule Path
Disallow /

yottashopping_bot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /*.jpg$

googlebot

Rule Path
Disallow /*.gif$

googlebot-image

Rule Path
Disallow /

obot

Rule Path
Disallow /