inst.org
robots.txt

Robots Exclusion Standard data for inst.org

Resource Scan

Scan Details

Site Domain inst.org
Base Domain inst.org
Scan Status Ok
Last Scan2025-08-23T01:53:06+00:00
Next Scan 2025-09-22T01:53:06+00:00

Last Scan

Scanned2025-08-23T01:53:06+00:00
URL https://inst.org/robots.txt
Redirect https://inst.org/robots.txt?v=7885444af42e
Domain IPs 104.21.54.124, 172.67.138.136, 2606:4700:3030::ac43:8a88, 2606:4700:3033::6815:367c
Response IP 172.67.138.136
Found Yes
Hash 8c0e9744684def00cd6da09068406436563b2fe8d4fcaccd5914aff49357dd67
SimHash cb01aa22cb93

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/uploads/wp-import-export-lite/

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://inst.org/sitemap.xml
sitemap https://inst.org/sitemap.rss