careerindia.com
robots.txt

Robots Exclusion Standard data for careerindia.com

Resource Scan

Scan Details

Site Domain careerindia.com
Base Domain careerindia.com
Scan Status Ok
Last Scan2024-06-07T05:10:22+00:00
Next Scan 2024-06-14T05:10:22+00:00

Last Scan

Scanned2024-06-07T05:10:22+00:00
URL https://careerindia.com/robots.txt
Redirect https://www.careerindia.com/robots.txt
Redirect Domain www.careerindia.com
Redirect Base careerindia.com
Domain IPs 104.18.26.221, 104.18.27.221, 2606:4700::6812:1add, 2606:4700::6812:1bdd
Redirect IPs 104.18.26.221, 104.18.27.221, 2606:4700::6812:1add, 2606:4700::6812:1bdd
Response IP 104.18.26.221
Found Yes
Hash bdb82484c82e9435829c649da319aa8f049a29d57425921bdf3dcc51bbda9821
SimHash eb0f57734991

Groups

*

Rule Path
Allow /
Disallow /wire/
Disallow /temp/
Disallow /amphtml/temp/
Disallow /counselling/includes/
Disallow /results-colleges.html*
Disallow /answer-key/
Disallow /scripts/
Disallow /colleges/engineering/
Disallow /colleges/law/
Disallow /colleges/medical/
Disallow /colleges/mba/
Disallow /colleges/dental/
Disallow /hyderabad/
Disallow /pune/
Disallow /chennai/
Disallow /coimbatore/
Disallow /kolkata/
Disallow /bangalore/
Disallow /mumbai/
Disallow /ahemdabad/
Disallow /*?utm*
Disallow /*?ref*

googlebot-news

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

grapeshot

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.careerindia.com/sitemap_news.xml
sitemap https://www.careerindia.com/sitemap-latest.xml
sitemap https://www.careerindia.com/sitemap_index.xml
sitemap https://www.careerindia.com/sitemap-webstories.xml