napaba.org
robots.txt

Robots Exclusion Standard data for napaba.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	napaba.org
Base Domain	napaba.org
Scan Status	Ok
Last Scan	2025-07-25T00:38:27+00:00
Next Scan	2025-08-24T00:38:27+00:00

Last Scan

Scanned	2025-07-25T00:38:27+00:00
URL	https://www.napaba.org/robots.txt
Domain IPs	35.169.50.49, 35.173.82.140, 35.174.132.21
Response IP	35.174.132.21
Found	Yes
Hash	6aa2fa2aa1f04150e7544f70618afc8a7be517cc99755226dc1a1d6ee66fe845
SimHash	edd4dd42c1d8

Groups

*

Rule	Path
Disallow	/person
Disallow	/member
Disallow	/members

Rule

Path

Disallow

/person

Disallow

/member

Disallow

/members

*

Rule	Path
Disallow	/global_inc/
Allow	/global_inc/*.css
Allow	/global_inc/*.js

Rule

Path

Disallow

/global_inc/

Allow

/global_inc/*.css

Allow

/global_inc/*.js

*

Rule	Path
Disallow	/global_engine/ajax/

Rule

Path

Disallow

/global_engine/ajax/

Back to top

Other Records

Field	Value
sitemap	http://www.napaba.org/autositemapindex.xml

Field

Value

sitemap

http://www.napaba.org/autositemapindex.xml

Back to top

Comments

Set by Tech Impact to prevent member directory from being indexed
When crawlers hit the engine dir they sometimes publish confusing links to site content
in their search results so we exclude these specific engines from crawling it.
Note: Certain crawlers do need access to this directory so we do not want a blanket
exlude statment here.

Back to top

Warnings

18 invalid lines.

Back to top

napaba.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

*

Other Records

Comments

Warnings

napaba.org
robots.txt