my-mooc.com
robots.txt
Robots Exclusion Standard data for my-mooc.com
Resource Scan
Scan Details
Site Domain | my-mooc.com |
Base Domain | my-mooc.com |
Scan Status | Ok |
Last Scan | 2024-10-17T04:56:29+00:00 |
Next Scan | 2024-11-16T04:56:29+00:00 |
Last Scan
Scanned | 2024-10-17T04:56:29+00:00 |
URL | https://www.my-mooc.com/robots.txt |
Domain IPs | 108.156.133.127, 108.156.133.68, 108.156.133.85, 108.156.133.92 |
Response IP | 108.156.133.68 |
Found | Yes |
Hash | f5f99f7ffa7a00ae4b3e71e753e0b10aa0bfca316caf864ff985d714bfe14fe5 |
SimHash | a81d7326e593 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/review/ |
Disallow | /*/cgu/ |
Disallow | /oauth/connect/ |
Disallow | /*/moocs?filter= |
Disallow | /*/?sort= |
Disallow | /*/?init= |
Disallow | /*/?moocsLimit= |
Disallow | /resource/*/open |
Other Records
Field | Value |
---|---|
sitemap | https://www.my-mooc.com/sitemap/www_fr.xml |
sitemap | https://www.my-mooc.com/sitemap/www_en.xml |
sitemap | https://www.my-mooc.com/sitemap/www_zh.xml |
sitemap | https://www.my-mooc.com/sitemap/www_ru.xml |
sitemap | https://www.my-mooc.com/sitemap/www_pt.xml |
Comments