thegreatcoursesplus.com
robots.txt

Robots Exclusion Standard data for thegreatcoursesplus.com

Resource Scan

Scan Details

Site Domain thegreatcoursesplus.com
Base Domain thegreatcoursesplus.com
Scan Status Ok
Last Scan2024-10-02T07:12:11+00:00
Next Scan 2024-10-16T07:12:11+00:00

Last Scan

Scanned2024-10-02T07:12:11+00:00
URL https://thegreatcoursesplus.com/robots.txt
Redirect https://www.thegreatcoursesplus.com:443/robots.txt
Redirect Domain www.thegreatcoursesplus.com
Redirect Base thegreatcoursesplus.com
Domain IPs 34.196.164.190, 54.167.77.248
Redirect IPs 23.210.99.38
Response IP 104.103.150.253
Found Yes
Hash d63e3d17932c0f24cac93c6b3dbb419985b5334daefffc134bad0d050561aa05
SimHash 43279583d392

Groups

*

Rule Path
Disallow /index.php/
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/*
Disallow /sendfriend/
Disallow /review/
Disallow /lp/*
Disallow /special-offer
Disallow /so-free-month
Disallow /3-mo-plan-2019
Disallow /long-landing-page
Disallow /short-landing-page
Disallow /catalogsearch*
Disallow /bvstate*
Disallow /welcome
Disallow /welcome-mobile

Other Records

Field Value
sitemap https://www.thegreatcoursesplus.com/media/sitemap.xml

Warnings

  • 1 invalid line.