inas.org
robots.txt

Robots Exclusion Standard data for inas.org

Resource Scan

Scan Details

Site Domain inas.org
Base Domain inas.org
Scan Status Ok
Last Scan2026-03-19T10:26:28+00:00
Next Scan 2026-04-02T10:26:28+00:00

Last Scan

Scanned2026-03-19T10:26:28+00:00
URL https://inas.org/robots.txt
Domain IPs 104.18.8.177, 104.18.9.177, 2606:4700::6812:8b1, 2606:4700::6812:9b1
Response IP 104.18.8.177
Found Yes
Hash e671cd3b215cd855e5725d9d301ca652f5aa3239afc8a72b277a782d224abb1d
SimHash 6908d8040bb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /author/
Disallow /*/trackback
Disallow /tag/
Disallow /*/feed
Disallow /?s=*
Disallow /attachment/
Disallow /*?utm_source
Disallow /*%26utm_source

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://inas.org/sitemap.xml

Comments

  • Allow Facebook scraper