learningforsustainability.net
robots.txt

Robots Exclusion Standard data for learningforsustainability.net

Resource Scan

Scan Details

Site Domain learningforsustainability.net
Base Domain learningforsustainability.net
Scan Status Ok
Last Scan2025-09-17T11:29:48+00:00
Next Scan 2025-10-17T11:29:48+00:00

Last Scan

Scanned2025-09-17T11:29:48+00:00
URL https://learningforsustainability.net/robots.txt
Domain IPs 104.21.40.68, 172.67.179.228, 2606:4700:3030::ac43:b3e4, 2606:4700:3036::6815:2844
Response IP 172.67.179.228
Found Yes
Hash bf0f0db6d7ad9e891546a7d9b727764e7c85f8e296c2bb243d47494aac15584c
SimHash 001daea0a393

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://learningforsustainability.net/wp-sitemap.xml

Comments

  • Block known aggressive bots (SEO scrapers, AI crawlers)
  • Allow major search engines (Google, Bing, etc.)