learningforsustainability.net
robots.txt

Robots Exclusion Standard data for learningforsustainability.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	learningforsustainability.net
Base Domain	learningforsustainability.net
Scan Status	Ok
Last Scan	2025-09-17T11:29:48+00:00
Next Scan	2025-10-17T11:29:48+00:00

Last Scan

Scanned	2025-09-17T11:29:48+00:00
URL	https://learningforsustainability.net/robots.txt
Domain IPs	104.21.40.68, 172.67.179.228, 2606:4700:3030::ac43:b3e4, 2606:4700:3036::6815:2844
Response IP	172.67.179.228
Found	Yes
Hash	bf0f0db6d7ad9e891546a7d9b727764e7c85f8e296c2bb243d47494aac15584c
SimHash	001daea0a393

Groups

*

Rule	Path
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://learningforsustainability.net/wp-sitemap.xml

Field

Value

sitemap

https://learningforsustainability.net/wp-sitemap.xml

Back to top

Comments

Block known aggressive bots (SEO scrapers, AI crawlers)
Allow major search engines (Google, Bing, etc.)

Back to top

learningforsustainability.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbot

semrushbot

petalbot

yandexbot

gptbot

ccbot

Other Records

Comments

learningforsustainability.net
robots.txt