waset.org
robots.txt

Robots Exclusion Standard data for waset.org

Resource Scan

Scan Details

Site Domain waset.org
Base Domain waset.org
Scan Status Ok
Last Scan2024-04-30T11:41:39+00:00
Next Scan 2024-05-30T11:41:39+00:00

Last Scan

Scanned2024-04-30T11:41:39+00:00
URL https://waset.org/robots.txt
Domain IPs 104.21.90.217, 172.67.205.138, 2606:4700:3031::6815:5ad9, 2606:4700:3032::ac43:cd8a
Response IP 172.67.205.138
Found Yes
Hash a1fef2cd86996f39275e83f88da8b306c7f8cb2430afd5bbb81c643a42f6c0ed
SimHash 491d0051c7d1

Groups

googlebot

Rule Path
Allow /

*

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://waset.org/sitemaps/index.xml
sitemap https://publications.waset.org/sitemaps/index_publications.xml