captainphoto.info
robots.txt

Robots Exclusion Standard data for captainphoto.info

Resource Scan

Scan Details

Site Domain captainphoto.info
Base Domain captainphoto.info
Scan Status Ok
Last Scan2024-06-29T03:39:59+00:00
Next Scan 2024-07-13T03:39:59+00:00

Last Scan

Scanned2024-06-29T03:39:59+00:00
URL https://captainphoto.info/robots.txt
Domain IPs 104.21.19.32, 172.67.184.237, 2606:4700:3033::ac43:b8ed, 2606:4700:3034::6815:1320
Response IP 172.67.184.237
Found Yes
Hash b025a2912eab8698e68949d5470fab3cf999da206524af515cdeae392edf4d89
SimHash 50167970c48a

Groups

ahrefsbot
blexbot
baiduspider
bytedance
bytespider
ccbot
chatgpt-user
claude-web
claudebot
dataforseobot
diffbot
domainstatsbot
gptbot
httrack
httrack 3.0
mj12bot
mail.ru_bot
offline explorer
perplexitybot
petalbot
seokicks
scrapy
screaming frog seo spider
semrushbot
sogou web spider
wellknownbot
xenu
yandexbot
youbot
zoominfobot
anthropic-ai
archive.org_bot
cohere-ai
dotbot
ia_archiver
rogerbot
barkrowler
brightedge crawler
cocolyzebot
hypestat
linkdexbot
online-webceo-bot
serpstatbot
sitecheckerbotcrawler
seolizer
seobilitybot
senutobot

Rule Path
Disallow /

Warnings

  • 1 invalid line.