in.coursera.org
robots.txt

Robots Exclusion Standard data for in.coursera.org

Resource Scan

Scan Details

Site Domain in.coursera.org
Base Domain coursera.org
Scan Status Ok
Last Scan2024-06-17T11:40:17+00:00
Next Scan 2024-07-01T11:40:17+00:00

Last Scan

Scanned2024-06-17T11:40:17+00:00
URL https://in.coursera.org/robots.txt
Redirect https://www.coursera.org/robots.txt
Redirect Domain www.coursera.org
Redirect Base coursera.org
Domain IPs 13.33.88.119, 13.33.88.124, 13.33.88.33, 13.33.88.53
Redirect IPs 13.33.88.119, 13.33.88.124, 13.33.88.33, 13.33.88.53
Response IP 13.33.88.124
Found Yes
Hash 9e62882a29144e3bf2669b101e41dd92fa5f72f6a0292396d8b599c6b118ea97
SimHash 4839f261ff13

Groups

*

Rule Path
Allow /api/utilities/v1/imageproxy
Disallow /maestro/api/
Disallow /api/
Disallow /maestro/
Disallow /ui/
Disallow /signature/voucher/
Disallow /account/
Disallow /acclaimbadge/
Disallow /voucher/
Disallow /search
Disallow /learn-perf/
Disallow /specializations-perf/
Disallow /professional-certificates-perf/
Disallow /learn-noperf/
Disallow /specializations-noperf/
Disallow /professional-certificates-noperf/
Disallow /career-academy/programs/
Disallow /business/xmlrpc.php
Disallow /business/wp-content/uploads/

ccbot

Rule Path
Disallow /lecture/

gptbot

Rule Path
Disallow /lecture/

linkedinbot

Rule Path
Allow /account/accomplishments/

twitterbot

Rule Path
Allow /account/accomplishments/

Other Records

Field Value
sitemap https://www.coursera.org/sitemap.xml