articulateinitiative.org
robots.txt

Robots Exclusion Standard data for articulateinitiative.org

Resource Scan

Scan Details

Site Domain articulateinitiative.org
Base Domain articulateinitiative.org
Scan Status Ok
Last Scan2025-09-28T20:37:16+00:00
Next Scan 2025-10-05T20:37:16+00:00

Last Scan

Scanned2025-09-28T20:37:16+00:00
URL https://articulateinitiative.org/robots.txt
Domain IPs 104.21.73.30, 172.67.157.54, 2606:4700:3031::ac43:9d36, 2606:4700:3037::6815:491e
Response IP 172.67.157.54
Found Yes
Hash 7a8de1020c715c082f38915f05cbe01c7bd4b14631795546c5dda68c3171157a
SimHash 401c95408512

Groups

googlebot

Rule Path
Disallow /info/
Disallow /search/

mediapartners-google

Rule Path
Disallow /info/
Disallow /search/

yahoo! slurp

Rule Path
Allow /$
Disallow /

bingbot

Rule Path
Allow /$
Disallow /

yandex

Rule Path
Allow /$
Disallow /

baiduspider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow

ips-agent

Rule Path
Disallow /parking.php4

blexbot

Rule Path
Disallow /

pandalytics

Rule Path
Disallow /info/
Disallow /search/

ioncrawl

Rule Path
Disallow /info/
Disallow /search/

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

bytedance

Rule Path
Disallow /

bytespider
baiduspider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

oncrawl

Rule Path
Disallow /

botify

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

*

Rule Path
Disallow /