alphahistory.com
robots.txt

Robots Exclusion Standard data for alphahistory.com

Resource Scan

Scan Details

Site Domain alphahistory.com
Base Domain alphahistory.com
Scan Status Ok
Last Scan2024-11-16T12:37:09+00:00
Next Scan 2024-11-23T12:37:09+00:00

Last Scan

Scanned2024-11-16T12:37:09+00:00
URL https://alphahistory.com/robots.txt
Domain IPs 104.26.4.107, 104.26.5.107, 172.67.74.33, 2606:4700:20::681a:46b, 2606:4700:20::681a:56b, 2606:4700:20::ac43:4a21
Response IP 104.26.5.107
Found Yes
Hash 407fec4a28c9ee14d12f479c47fe5bc0066c25e53edb3166dbc2eae3426852e0
SimHash f09a490b80a6

Groups

ia_archiver

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /