pt.coursera.org
robots.txt

Robots Exclusion Standard data for pt.coursera.org

Resource Scan

Scan Details

Site Domain pt.coursera.org
Base Domain coursera.org
Scan Status Ok
Last Scan2024-05-06T10:23:17+00:00
Next Scan 2024-05-20T10:23:17+00:00

Last Scan

Scanned2024-05-06T10:23:17+00:00
URL https://pt.coursera.org/robots.txt
Redirect https://www.coursera.org/robots.txt
Redirect Domain www.coursera.org
Redirect Base coursera.org
Domain IPs 13.33.88.119, 13.33.88.124, 13.33.88.33, 13.33.88.53
Redirect IPs 13.33.88.119, 13.33.88.124, 13.33.88.33, 13.33.88.53
Response IP 13.33.88.119
Found Yes
Hash 210c204e41657849d0a47c1a8ea2ac46f1d462f1265bfb73ccfc4d56ba46d147
SimHash 4919ff65b753

Groups

*

Rule Path
Allow /api/utilities/v1/imageproxy
Disallow /maestro/api/
Disallow /api/
Disallow /maestro/
Disallow /ui/
Disallow /signature/voucher/
Disallow /account/email_verify/
Disallow /acclaimbadge/
Disallow /voucher/
Disallow /search
Disallow /lecture/
Disallow /learn-perf/
Disallow /specializations-perf/
Disallow /professional-certificates-perf/
Disallow /learn-noperf/
Disallow /specializations-noperf/
Disallow /professional-certificates-noperf/
Disallow /career-academy/programs/
Disallow /business/xmlrpc.php
Disallow /business/wp-content/uploads/

ccbot

Rule Path
Disallow /lecture/

gptbot

Rule Path
Disallow /lecture/

Other Records

Field Value
sitemap https://www.coursera.org/sitemap.xml