sites.wustl.edu
robots.txt
Robots Exclusion Standard data for sites.wustl.edu
Resource Scan
Scan Details
Site Domain | sites.wustl.edu |
Base Domain | wustl.edu |
Scan Status | Ok |
Last Scan | 2024-06-01T11:32:08+00:00 |
Next Scan | 2024-07-01T11:32:08+00:00 |
Last Scan
Scanned | 2024-06-01T11:32:08+00:00 |
URL | https://sites.wustl.edu/robots.txt |
Domain IPs | 34.215.37.29, 34.216.237.15 |
Response IP | 34.215.37.29 |
Found | Yes |
Hash | 82ef4d95d5d450c41bcb8b88537a8a243e0e90a588906b951b27c1e63fe11e58 |
SimHash | e0c45ec1892b |
Groups
Other Records
Field | Value |
---|---|
sitemap | https://sites.wustl.edu/wp-sitemap.xml |
Warnings
- 6 invalid lines.