jamboreeindia.com
robots.txt

Robots Exclusion Standard data for jamboreeindia.com

Resource Scan

Scan Details

Site Domain jamboreeindia.com
Base Domain jamboreeindia.com
Scan Status Ok
Last Scan2024-05-15T10:21:23+00:00
Next Scan 2024-06-14T10:21:23+00:00

Last Scan

Scanned2024-05-15T10:21:23+00:00
URL https://jamboreeindia.com/robots.txt
Redirect https://www.jamboreeindia.com/robots.txt
Redirect Domain www.jamboreeindia.com
Redirect Base jamboreeindia.com
Domain IPs 182.18.130.192
Redirect IPs 182.18.130.192
Response IP 182.18.130.192
Found Yes
Hash 176ec700fe77ab27366131cbc482ef3680d4ba50a0c535a07d87160c9d84cf52
SimHash 254c556057d4

Groups

*

Rule Path
Disallow /admin
Disallow /backup
Disallow /assets
Disallow /user_guide
Disallow /system
Disallow /home.git
Disallow /LP/
Disallow /LP
Disallow /Landing-Pages-new
Disallow /Form
Disallow /NEWLP
Disallow /test-prep
Disallow /Landing-Pages
Disallow /new-landing-pages
Disallow /gre-lp/
Disallow /gmat-lp/
Disallow /sat-lp/
Disallow /toefl-jamboree/
Disallow /ielts-jamboree/
Disallow /new-landing-pages/
Disallow /new-pages/
Disallow /newpages/
Disallow /sat-live/
Disallow /offline/
Disallow /gmat-live/
Disallow /gmatonline/
Disallow /satlive/
Disallow /liveonline/
Disallow /backup
Disallow /cc_av
Disallow /online-jamboree
Disallow /emailer-product
Disallow /scholarship-test
Disallow /before-live
Disallow /study-in/
Disallow /responsibility-policy
Disallow /student-resources
Disallow /student-resources-access
Disallow /?
Disallow /tag/
Disallow /wp-content/

ninjabot
bingbot
googlebot
msnbot
slurp
duckduckbot
baiduspider
yandexbot
ia_archiver
alexabot
mediapartners-google

Rule Path
Allow /

httrack

Rule Path
Disallow /
Disallow /search
Disallow /404
Disallow /search/
Disallow /blog/amp/*
Disallow /blog/amp/
Disallow /blog/amp
Disallow /know-how/amp/*
Disallow /know-how/amp/
Disallow /know-how/amp

Other Records

Field Value
sitemap https://www.jamboreeindia.com/sitemap.xml