apiit.edu.in
robots.txt

Robots Exclusion Standard data for apiit.edu.in

Archived Snapshots

Resource Scan

Scan Details

Site Domain	apiit.edu.in
Base Domain	apiit.edu.in
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-04-19T06:52:12+00:00
Next Scan	2025-07-18T06:52:12+00:00

Last Successful Scan

Scanned	2024-09-22T05:33:18+00:00
URL	https://apiit.edu.in/robots.txt
Domain IPs	66.103.203.194
Response IP	66.103.203.194
Found	Yes
Hash	e83379e873cdb5f969a2067850cabfebf8afcdd9bfaa6f4401efde9ded4c31bf
SimHash	a31f155843e5

Groups

*

Rule	Path
Disallow	/administrator/
Disallow	/cache/
Disallow	/cli/
Disallow	/components/
Disallow	/images/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/libraries/
Disallow	/logs/
Disallow	/media/
Disallow	/modules/
Disallow	/plugins/
Disallow	/templates/
Disallow	/tmp/

Rule

Path

Disallow

/administrator/

Disallow

/cache/

Disallow

/cli/

Disallow

/components/

Disallow

/images/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/logs/

Disallow

/media/

Disallow

/modules/

Disallow

/plugins/

Disallow

/templates/

Disallow

/tmp/

Back to top

Comments

If the Joomla site is installed within a folder such as at
e.g. www.example.com/joomla/ the robots.txt file MUST be
moved to the site root at e.g. www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to the disallowed
path, e.g. the Disallow rule for the /administrator/ folder
MUST be changed to read Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html

Back to top

apiit.edu.inrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Comments

apiit.edu.in
robots.txt