usc.edu.co
robots.txt
Robots Exclusion Standard data for usc.edu.co
Resource Scan
Scan Details
Site Domain | usc.edu.co |
Base Domain | usc.edu.co |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-08-21T04:27:24+00:00 |
Next Scan | 2024-10-20T04:27:24+00:00 |
Last Successful Scan
Scanned | 2024-05-31T04:25:40+00:00 |
URL | https://www.usc.edu.co/robots.txt |
Domain IPs | 18.208.68.235, 54.152.70.178 |
Response IP | 18.208.68.235 |
Found | Yes |
Hash | 96cdd1c0fcd3927d5ca9ae372a04ce34f73caca96f4e045755b61d4c0fb49809 |
SimHash | ea019800cbb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wpo-plugins-tables-list.json |
Other Records
Field | Value |
---|---|
sitemap | https://www.usc.edu.co/wp-sitemap.xml |