rafiusa.org
robots.txt

Robots Exclusion Standard data for rafiusa.org

Resource Scan

Scan Details

Site Domain rafiusa.org
Base Domain rafiusa.org
Scan Status Ok
Last Scan2025-09-25T15:45:33+00:00
Next Scan 2025-10-25T15:45:33+00:00

Last Scan

Scanned2025-09-25T15:45:33+00:00
URL https://rafiusa.org/robots.txt
Domain IPs 104.21.60.7, 172.67.186.211, 2606:4700:3034::ac43:bad3, 2606:4700:3036::6815:3c07
Response IP 104.21.60.7
Found Yes
Hash 410fa088ac785aa5ddd2ed7b7b6ba0f0511dd20eecf463e91ca5b577311c4133
SimHash a7ce0d10a003

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content
Disallow /piedmont-confirmation/
Disallow /registration-confirmation/
Disallow /tablerequest/
Disallow /success/
Disallow /tableconf/
Disallow /page/
Disallow /cdn-cgi/
Disallow /calendar/action~posterboard/
Disallow /calendar/action~agenda/
Disallow /calendar/action~oneday/
Disallow /calendar/action~month/
Disallow /calendar/action~week/
Disallow /calendar/action~stream/
Disallow /blog/tag/
Disallow /blog/category/
Disallow *?share=*
Disallow /thanks-for-subscribing/
Disallow /thanks-for-signing-up/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.rafiusa.org/sitemap_index.xml