artworkarchive.com
robots.txt

Robots Exclusion Standard data for artworkarchive.com

Resource Scan

Scan Details

Site Domain artworkarchive.com
Base Domain artworkarchive.com
Scan Status Ok
Last Scan2026-01-24T10:29:46+00:00
Next Scan 2026-02-23T10:29:46+00:00

Last Scan

Scanned2026-01-24T10:29:46+00:00
URL https://artworkarchive.com/robots.txt
Domain IPs 104.26.14.145, 104.26.15.145, 172.67.71.186, 2606:4700:20::681a:e91, 2606:4700:20::681a:f91, 2606:4700:20::ac43:47ba
Response IP 104.26.15.145
Found Yes
Hash d51956a259e26133ce342f396d04deb1c4100c948fac238485692ba9233be291
SimHash 5a3fff10c890

Groups

*

Rule Path
Disallow /from/
Disallow /admin/
Disallow /discovery/artwork/
Disallow /inbound_message/

gptbot

Rule Path
Disallow /discovery/*
Disallow /profile/*
Disallow /rooms/*
Disallow /room_viewer_session/*

chatgpt-user

Rule Path
Disallow /discovery/*
Disallow /profile/*

perplexitybot

Rule Path
Disallow /discovery/*
Disallow /profile/*

ccbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Warnings

  • 2 invalid lines.