merlinsbricks.com
robots.txt

Robots Exclusion Standard data for merlinsbricks.com

Resource Scan

Scan Details

Site Domain merlinsbricks.com
Base Domain merlinsbricks.com
Scan Status Ok
Last Scan2025-05-18T02:52:38+00:00
Next Scan 2025-06-17T02:52:38+00:00

Last Scan

Scanned2025-05-18T02:52:38+00:00
URL https://merlinsbricks.com/robots.txt
Redirect https://www.merlinsbricks.com/robots.txt
Redirect Domain www.merlinsbricks.com
Redirect Base merlinsbricks.com
Domain IPs 2a01:238:20a:202:1151::, 81.169.145.151
Redirect IPs 139.99.62.128, 2402:1f00:8001:376::
Response IP 139.99.62.128
Found Yes
Hash 34e08ead80934a6db02325eab245202e6825a1a4c46d92291f7cb52055866b08
SimHash 701d595384e0

Groups

*

Rule Path
Disallow /pinterest/
Disallow /short/
Disallow /profil/
Disallow /profile/
Disallow /my/

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meta-externalagent
meta-externalagent

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

quora-bot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.merlinsbricks.com/sitemap.xml