turflix.com
robots.txt

Robots Exclusion Standard data for turflix.com

Resource Scan

Scan Details

Site Domain turflix.com
Base Domain turflix.com
Scan Status Ok
Last Scan2024-10-29T17:07:09+00:00
Next Scan 2024-11-05T17:07:09+00:00

Last Scan

Scanned2024-10-29T17:07:09+00:00
URL https://turflix.com/robots.txt
Domain IPs 104.21.15.90, 172.67.205.169, 2606:4700:3033::ac43:cda9, 2606:4700:3037::6815:f5a
Response IP 172.67.205.169
Found Yes
Hash ecff12de842315558e3daeca97061be42e1148cab87e669d5b8c290b38afb05b
SimHash 709cd142c819

Groups

*

Rule Path
Allow /
Disallow /search/
Disallow /external/*
Disallow /blog
Disallow /go/*

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

mj12bot
amazonbot
blexbot
gptbot
ahrefssiteaudit
semrushbot
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-coub
splitsignalbot
rogerbot
exabot
dotbot
gigabot
semrushbot/7~bl
dataforseobot
clark-crawler2
petalbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 240