greatlearning.in
robots.txt

Robots Exclusion Standard data for greatlearning.in

Resource Scan

Scan Details

Site Domain greatlearning.in
Base Domain greatlearning.in
Scan Status Ok
Last Scan2024-11-04T13:31:52+00:00
Next Scan 2024-11-18T13:31:52+00:00

Last Scan

Scanned2024-11-04T13:31:52+00:00
URL https://greatlearning.in/robots.txt
Domain IPs 108.157.254.106, 108.157.254.57, 108.157.254.72, 108.157.254.81
Response IP 108.157.254.81
Found Yes
Hash b8256711b777855843325636285885d06433b507d48ad44e02a9d7f2d7569b3b
SimHash 08555a058013

Groups

*

Rule Path
Disallow /admin
Disallow /similar-profiles
Disallow /profiles
Disallow /w3migration
Disallow /migration
Disallow /pdf/*
Disallow /*.pdf$
Disallow *?enc_e_lid=*
Disallow *?enc_e_aid_p=*
Disallow /*%3D%3D$
Disallow /academy/enterprise/*
Disallow /academy/university/*
Disallow /academy/career/wp-content/*
Disallow /academy/search*
Disallow /api/v1/hydrate-user
Disallow /fsl/*/search*
Disallow /blog/*?filter_by=
Disallow /blog/?s=
Disallow /blog/*?catDropdown
Disallow /blog/*?fbclid
Disallow /blog/*?highlight
Disallow /blog/*?marketing_com
Disallow /blog/*?nonamp
Disallow /blog/*?nowprocket
Disallow /blog/*?type

Other Records

Field Value
sitemap https://www.greatlearning.in/sitemap.xml