mooc.org
robots.txt
Robots Exclusion Standard data for mooc.org
Resource Scan
Scan Details
Site Domain | mooc.org |
Base Domain | mooc.org |
Scan Status | Ok |
Last Scan | 2024-10-22T05:30:46+00:00 |
Next Scan | 2024-11-21T05:30:46+00:00 |
Last Scan
Scanned | 2024-10-22T05:30:46+00:00 |
URL | https://mooc.org/robots.txt |
Redirect | https://www.mooc.org/robots.txt |
Redirect Domain | www.mooc.org |
Redirect Base | mooc.org |
Domain IPs | 13.33.30.11, 13.33.30.120, 13.33.30.6, 13.33.30.61 |
Redirect IPs | 199.60.103.225, 199.60.103.31, 2606:2c40::c73c:671f, 2606:2c40::c73c:67e1 |
Response IP | 199.60.103.31 |
Found | Yes |
Hash | f7dad4b34a1c496bc7b6ab42bbf1d85a45d8280aa0ed34f8128d122d08c82085 |
SimHash | b8e9c530e191 |
Groups
*
Rule | Path |
---|---|
Disallow | /blog/tag/ |
Disallow | /blog/tag/* |
Disallow | /blog/author/ |
Disallow | /blog/author/* |
Disallow | /_hcms/preview/ |
Disallow | /hs/manage-preferences/ |
Disallow | /hs/preferences-center/ |
Disallow | /*?*hs_preview=* |
Disallow | /*?*hsCacheBuster=* |