training.simplicable.com
robots.txt

Robots Exclusion Standard data for training.simplicable.com

Resource Scan

Scan Details

Site Domain training.simplicable.com
Base Domain simplicable.com
Scan Status Ok
Last Scan2025-12-12T18:27:21+00:00
Next Scan 2026-01-11T18:27:21+00:00

Last Scan

Scanned2025-12-12T18:27:21+00:00
URL https://training.simplicable.com/robots.txt
Domain IPs 104.26.2.94, 104.26.3.94, 172.67.72.115, 2606:4700:20::681a:25e, 2606:4700:20::681a:35e, 2606:4700:20::ac43:4873
Response IP 104.26.3.94
Found Yes
Hash e46017a9258c526b2f7a7fad3d97398c1edd6be8e5f60d4f2b8ce9028befb164
SimHash 2114db83e701

Groups

ia_archiver

Rule Path
Disallow /

tineye

Rule Path
Disallow /

googlebot-image

Rule Path
Allow /images/favicon.ico
Disallow /

*

Rule Path
Disallow /

googlebot

Rule Path
Allow /
Disallow /*.text$

mediapartners-google

Rule Path
Allow /

slurp

Rule Path
Allow /
Disallow /images/
Disallow /*.text$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.gif$

bingbot

Rule Path
Allow /
Disallow /images/
Disallow /*.text$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.gif$

duckduckbot

Rule Path
Allow /
Disallow /images/
Disallow /*.text$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.gif$