/.well-known/

Log In Sign Up

ashs.org
robots.txt

Robots Exclusion Standard data for ashs.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ashs.org
Base Domain	ashs.org
Scan Status	Ok
Last Scan	4/25/2025, 6:42:15 AM
Next Scan	5/25/2025, 6:42:15 AM

Last Scan

Scanned	4/25/2025, 6:42:15 AM
URL	https://ashs.org/robots.txt
Domain IPs	35.169.50.49, 35.173.82.140, 35.174.132.21
Response IP	35.169.50.49
Found	Yes
Hash	e5b3e193363f132425b1d0192c5d8e7ddc0f29157302f0617301e885abdb5535
SimHash	ec949d42c3d8

Groups

*

Rule

Path

Disallow

/global_inc/

Allow

/global_inc/*.css

Allow

/global_inc/*.js

*

Rule

Path

Disallow

/global_engine/ajax/

Back to top

Other Records

Field

Value

sitemap

https://ashs.org/autositemapindex.xml

Back to top

Comments

When crawlers hit the engine dir they sometimes publish confusing links to site content
in their search results so we exclude these specific engines from crawling it.
Note: Certain crawlers do need access to this directory so we do not want a blanket
exlude statment here.

Back to top

Warnings

18 invalid lines.

Back to top