futurelearn.com
robots.txt
Robots Exclusion Standard data for futurelearn.com
Resource Scan
Scan Details
Site Domain | futurelearn.com |
Base Domain | futurelearn.com |
Scan Status | Ok |
Last Scan | 2024-11-14T08:38:12+00:00 |
Next Scan | 2024-11-28T08:38:12+00:00 |
Last Scan
Scanned | 2024-11-14T08:38:12+00:00 |
URL | https://futurelearn.com/robots.txt |
Redirect | https://assets.futurelearn.com:443/robots.txt |
Redirect Domain | assets.futurelearn.com |
Redirect Base | futurelearn.com |
Domain IPs | 104.18.28.94, 104.18.29.94 |
Redirect IPs | 104.18.28.94, 104.18.29.94 |
Response IP | 104.18.28.94 |
Found | Yes |
Hash | ef17d3d8b25b07e6d10d7afb9fbf2bf3e3010a9c9bd4a0fec45e0b8c0963d992 |
SimHash | 29044750d191 |
Groups
*
Rule | Path |
---|---|
Disallow | /info/wp-admin/ |
Disallow | /info/wp-includes/ |
Disallow | /info/wp-json/ |
Disallow | /info/xmlrpc.php |
Disallow | /comments/ |
Disallow | /profiles/ |
Allow | /info/wp-includes/js/jquery/jquery.js |
Allow | /info/wp-includes/js/jquery/jquery-migrate.min.js |
Other Records
Field | Value |
---|---|
sitemap | https://www.futurelearn.com/sitemap.xml |