gidc.in
robots.txt

Robots Exclusion Standard data for gidc.in

Resource Scan

Scan Details

Site Domain gidc.in
Base Domain gidc.in
Scan Status Ok
Last Scan2025-05-11T09:56:04+00:00
Next Scan 2025-06-10T09:56:04+00:00

Last Scan

Scanned2025-05-11T09:56:04+00:00
URL http://gidc.in/robots.txt
Domain IPs 103.233.77.51
Response IP 103.233.77.51
Found Yes
Hash 4522ba880ed93b1b40594d4f4a63a25b7f5e813b74fb389c980fe865fd6e3ab6
SimHash 2b2d7d81c412

Groups

*

Rule Path
Disallow /cache/
Disallow /cp/
Disallow /docs/
Disallow /files/
Disallow /includes/
Disallow /members/
Disallow /modules/
Disallow /template/
Disallow /cron.php
Disallow /maintenance.php
Disallow /ajax.php
Disallow /error.html
Disallow /*/send-message.html
Disallow /*/send-message-friend.html
Disallow /*/add-review.html
Disallow /*/suggestion.html
Disallow /*/claim.html
Disallow /out-*.html

Other Records

Field Value
sitemap http://www.gidc.in/sitemap.xml