104filmy.pl
robots.txt

Robots Exclusion Standard data for 104filmy.pl

Resource Scan

Scan Details

Site Domain 104filmy.pl
Base Domain 104filmy.pl
Scan Status Ok
Last Scan2024-11-12T12:53:22+00:00
Next Scan 2024-11-19T12:53:22+00:00

Last Scan

Scanned2024-11-12T12:53:22+00:00
URL https://104filmy.pl/robots.txt
Redirect https://bestflix.pl/robots.txt
Redirect Domain bestflix.pl
Redirect Base bestflix.pl
Domain IPs 104.21.26.81, 172.67.135.164, 2606:4700:3034::6815:1a51, 2606:4700:3034::ac43:87a4
Redirect IPs 104.21.94.131, 172.67.136.79, 2606:4700:3034::6815:5e83, 2606:4700:3034::ac43:884f
Response IP 104.21.94.131
Found Yes
Hash ecff12de842315558e3daeca97061be42e1148cab87e669d5b8c290b38afb05b
SimHash 709cd142c819

Groups

*

Rule Path
Allow /
Disallow /search/
Disallow /external/*
Disallow /blog
Disallow /go/*

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

mj12bot
amazonbot
blexbot
gptbot
ahrefssiteaudit
semrushbot
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-coub
splitsignalbot
rogerbot
exabot
dotbot
gigabot
semrushbot/7~bl
dataforseobot
clark-crawler2
petalbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 240