procherk.info
robots.txt
Robots Exclusion Standard data for procherk.info
Resource Scan
Scan Details
Site Domain | procherk.info |
Base Domain | procherk.info |
Scan Status | Ok |
Last Scan | 2024-11-09T19:02:39+00:00 |
Next Scan | 2024-11-16T19:02:39+00:00 |
Last Scan
Scanned | 2024-11-09T19:02:39+00:00 |
URL | https://procherk.info/robots.txt |
Domain IPs | 104.21.54.147, 172.67.139.104, 2606:4700:3030::ac43:8b68, 2606:4700:3037::6815:3693 |
Response IP | 172.67.139.104 |
Found | Yes |
Hash | 313ca43e8ad458e906789560a36b0e919aee895b0321dde16d277104e0ec8555 |
SimHash | aa60c3e04874 |
Groups
*
Rule | Path |
---|---|
Disallow | /index2.php? |
Disallow | *search |
Disallow | *newsfeeds |
Disallow | *wrapper |
Disallow | |
Allow | *printer |
Disallow | |
Disallow | *rss |
Disallow | *user |
Disallow | *com_mtree |
Disallow | *com_joomgallery |
Disallow | *com_eventlist |
Disallow | *com_jvlinx |
Disallow | /administrator/ |
Disallow | /cache/ |
Disallow | /component/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /language/ |
Disallow | /libraries/ |
Disallow | /media/ |
Disallow | /tmp/ |
Disallow | /xmlrpc/ |