blogs.cornell.edu
robots.txt
Robots Exclusion Standard data for blogs.cornell.edu
Resource Scan
Scan Details
Site Domain | blogs.cornell.edu |
Base Domain | cornell.edu |
Scan Status | Ok |
Last Scan | 2024-05-01T16:13:42+00:00 |
Next Scan | 2024-05-31T16:13:42+00:00 |
Last Scan
Scanned | 2024-05-01T16:13:42+00:00 |
URL | https://blogs.cornell.edu/robots.txt |
Domain IPs | 100.24.182.117, 184.72.224.80, 3.91.109.122, 34.199.202.106, 34.227.238.166, 35.172.73.102 |
Response IP | 34.227.238.166 |
Found | Yes |
Hash | 36ce4a7a53c0e83f719620e675beadab5215a463936959ff89fa18dd691b85f6 |
SimHash | e0c73bc2888b |
Groups
Other Records
Field | Value |
---|---|
sitemap | https://blogs.cornell.edu/wp-sitemap.xml |
Warnings
- 6 invalid lines.