archive.blogs.harvard.edu
robots.txt

Robots Exclusion Standard data for archive.blogs.harvard.edu

Resource Scan

Scan Details

Site Domain archive.blogs.harvard.edu
Base Domain harvard.edu
Scan Status Ok
Last Scan2024-05-21T01:08:01+00:00
Next Scan 2024-06-20T01:08:01+00:00

Last Scan

Scanned2024-05-21T01:08:01+00:00
URL https://archive.blogs.harvard.edu/robots.txt
Domain IPs 199.16.172.158, 199.16.173.182
Response IP 199.16.173.182
Found Yes
Hash 8d3884fca4c41f46e92b7330fee42fd91878f5ec8f246b789beae6c28c182b23
SimHash 410188228b92

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://archive.blogs.harvard.edu/sitemap.xml
sitemap https://archive.blogs.harvard.edu/news-sitemap.xml