img16.imageshack.us
robots.txt

Robots Exclusion Standard data for img16.imageshack.us

Resource Scan

Scan Details

Site Domain img16.imageshack.us
Base Domain imageshack.us
Scan Status Ok
Last Scan2024-09-21T18:39:34+00:00
Next Scan 2024-10-21T18:39:34+00:00

Last Scan

Scanned2024-09-21T18:39:34+00:00
URL https://img16.imageshack.us/robots.txt
Domain IPs 38.99.77.16, 38.99.77.17
Response IP 38.99.77.16
Found Yes
Hash da06824982835a872347871e3b1788cf0f4c594c593a07a1065c43e53eaa4e09
SimHash f0164901c1c4

Groups

ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
googleother
googleother-image
googleother-video
icc-crawler
imagesiftbot
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
img2dataset
omgili
omgilibot

Rule Path
Disallow /