iea-shc.org
robots.txt

Robots Exclusion Standard data for iea-shc.org

Resource Scan

Scan Details

Site Domain iea-shc.org
Base Domain iea-shc.org
Scan Status Ok
Last Scan2025-05-19T03:53:36+00:00
Next Scan 2025-06-18T03:53:36+00:00

Last Scan

Scanned2025-05-19T03:53:36+00:00
URL https://iea-shc.org/robots.txt
Domain IPs 199.115.204.214
Response IP 199.115.204.214
Found Yes
Hash 77bd4e721135c9145fa35e59154c662fad9f35a5565200ea98c94c19cc27a736
SimHash e18a45846192

Groups

*

Rule Path
Disallow /CaptchaImage.ashx*
Disallow /Admin/
Disallow /App_Browsers/
Disallow /App_Code/
Disallow /App_Data/
Disallow /App_Themes/
Disallow /bin/
Disallow /Blog/ViewCategory.aspx$
Disallow /Blog/ViewArchive.aspx$
Disallow /Data/SiteImages/emoticons
Disallow /MyPage.aspx
Disallow /MyPage.aspx$
Disallow /MyPage.aspx*
Disallow /NeatHtml/
Disallow /NeatUpload/
Disallow /nofollow/
Disallow /nf/
Disallow /Secure/
Disallow /Services/TinyMCETemplates.ashx$
Disallow /SearchResults.aspx$
Disallow /SearchResults.aspx*
Disallow /SiteMap.aspx
Disallow /SiteOffice/
Disallow /Setup/
Disallow /WebStore/CartAdd.aspx$
Disallow /WebStore/CartAdd.aspx*
Disallow /WebStore/Cart.aspx$
Disallow /WebStore/Cart.aspx*
Disallow /Error.htm
Disallow /industry-workshop-2019-05

Comments

  • robots.txt