assortedflotsam.com
robots.txt

Robots Exclusion Standard data for assortedflotsam.com

Resource Scan

Scan Details

Site Domain assortedflotsam.com
Base Domain assortedflotsam.com
Scan Status Ok
Last Scan2025-04-10T21:34:58+00:00
Next Scan 2025-04-11T21:34:58+00:00

Last Scan

Scanned2025-04-10T21:34:58+00:00
URL https://assortedflotsam.com/robots.txt
Domain IPs 144.126.131.135
Response IP 144.126.131.135
Found Yes
Hash bf2d335a32010ada61357270ebece4c4e7a203ea543487b19da41d34143bb73d
SimHash f2171b01d744

Groups

ccbot
chatgpt-user
gptbot
gptbot-user
chatgpt
bytespider
claudebot
imagesiftbot
omgili
diffbot
claude-web
perplexitybot
searchgpt
searchgpt-user
meta-externalfetcher
amazonbot
applebot
oai-searchbot
youbot
applebot-extended
facebookbot
google-extended
meta-externalagent
ai agent anthropic-ai
ai agent claude-web
ai2bot
ai2bot-dolma
friendlycrawler
googleother
googleother-image
googleother-video
icc-crawler
imagesiftbot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
facebookexternalhit
img2dataset

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file