clipart.com
robots.txt

Robots Exclusion Standard data for clipart.com

Resource Scan

Scan Details

Site Domain clipart.com
Base Domain clipart.com
Scan Status Ok
Last Scan2024-06-28T14:08:28+00:00
Next Scan 2024-07-05T14:08:28+00:00

Last Scan

Scanned2024-06-28T14:08:28+00:00
URL https://clipart.com/robots.txt
Domain IPs 104.26.2.71, 104.26.3.71, 172.67.69.192, 2606:4700:20::681a:247, 2606:4700:20::681a:347, 2606:4700:20::ac43:45c0
Response IP 104.26.3.71
Found Yes
Hash 6527f02d65f8f57533f4245c5a0ac53f6761aeefbd45edf1d42fd6023f1f5f46
SimHash 0a28f8a0ea02

Groups

*

Rule Path
Disallow /activation/
Disallow /dover/
Disallow /imsi/
Disallow /signcut/
Disallow /thinkedu/

lwp-trivial

Rule Path
Disallow /

lwp::simple

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /