glusea.com
robots.txt
Robots Exclusion Standard data for glusea.com
Resource Scan
Scan Details
Site Domain | glusea.com |
Base Domain | glusea.com |
Scan Status | Ok |
Last Scan | 2024-10-07T14:33:27+00:00 |
Next Scan | 2024-10-14T14:33:27+00:00 |
Last Scan
Scanned | 2024-10-07T14:33:27+00:00 |
URL | https://glusea.com/robots.txt |
Domain IPs | 104.21.44.239, 172.67.205.67, 2606:4700:3035::6815:2cef, 2606:4700:3036::ac43:cd43 |
Response IP | 172.67.205.67 |
Found | Yes |
Hash | dad8ecf355db5d63fa6f01fe137e461187ef4f5e25960e97420915253d16a60d |
SimHash | 61055870e190 |
Groups
*
Rule | Path |
---|---|
Disallow | /phpmyadmin/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Allow | /wp-includes/js/ |
Allow | /wp-includes/images/ |
Disallow | /hotlinks/ |
Disallow | /dev/ |
Disallow | /dl/ |
Disallow | /out.php |
Disallow | /fb_comment_id%3D* |
Other Records
Field | Value |
---|---|
sitemap | https://www.glusea.com/sitemap_index.xml |
Warnings
- `https` is not a known field.