allen.ac.in
robots.txt

Robots Exclusion Standard data for allen.ac.in

Resource Scan

Scan Details

Site Domain allen.ac.in
Base Domain allen.ac.in
Scan Status Ok
Last Scan2025-06-04T15:48:46+00:00
Next Scan 2025-07-04T15:48:46+00:00

Last Scan

Scanned2025-06-04T15:48:46+00:00
URL https://allen.ac.in/robots.txt
Redirect https://www.allen.ac.in/robots.txt
Redirect Domain www.allen.ac.in
Redirect Base allen.ac.in
Domain IPs 13.200.188.135
Redirect IPs 13.200.188.135
Response IP 13.200.188.135
Found Yes
Hash 6bca7a05f057a364537695505c67b43f00d9f8ff4fa9250f2237194bb4cbf646
SimHash 382fbdf5b3d7

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /css/
Disallow /js/
Disallow /AWStats
Disallow /distance_learning-old
Disallow /victory-celebration/
Disallow /apps/session1314/intResults/M/CCP/
Disallow /apps/session1314/intResults/E/CCP/
Disallow /apps/session1314/intResults/T/CCP/
Disallow /apps/session1314/intResults/Pre-Nurture/
Disallow /apps/session1213/intResults/ALLENAIIMS/
Disallow /ahmedabad/tallentex/
Disallow /2013-14/
Disallow /2014-15/
Disallow /2015-16/
Disallow /2016-17/
Disallow /2017-18/
Disallow /2018-19/
Disallow /2019-20/
Disallow /2020-21/
Disallow /2021-22/
Disallow /2023-24/
Disallow /apps2425/
Disallow /apps2324/
Disallow /ace2324/
Disallow /imagegallery/
Allow /apps/selection-results/
Allow /images

Other Records

Field Value
sitemap https://www.allen.ac.in/sitemap.xml

Comments

  • allen.ac.in robot file