jta.org
robots.txt

Robots Exclusion Standard data for jta.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	jta.org
Base Domain	jta.org
Scan Status	Ok
Last Scan	2024-11-12T03:48:25+00:00
Next Scan	2024-11-19T03:48:25+00:00

Last Scan

Scanned	2024-11-12T03:48:25+00:00
URL	https://jta.org/robots.txt
Domain IPs	151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133, 2a04:4e42:200::645, 2a04:4e42:400::645, 2a04:4e42:600::645, 2a04:4e42::645
Response IP	151.101.194.133
Found	Yes
Hash	7729892591266f879aeef7e5698bd35f3cd7934f1109fab01761b86901991c4e
SimHash	4b01dc40cb11

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/jfed

Rule

Path

Disallow

/wp-admin/

Disallow

/jfed

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

maxpoint

Rule	Path
Disallow	/

Rule

Path

Disallow

/

maxpointcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

maxpoint bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

nutch

Rule	Path
Disallow	/

Rule

Path

Disallow

/

msnbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

Back to top

Other Records

Field	Value
sitemap	https://www.jta.org/sitemap_index.xml

Field

Value

sitemap

https://www.jta.org/sitemap_index.xml

Back to top

Comments

Sitemap archive
Crawlers

Back to top

jta.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

petalbot

maxpoint

maxpointcrawler

maxpoint bot

nutch

msnbot

Other Records

Other Records

Comments

jta.org
robots.txt