gemiilan.com
robots.txt
Robots Exclusion Standard data for gemiilan.com
Resource Scan
Scan Details
Site Domain | gemiilan.com |
Base Domain | gemiilan.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2025-08-28T09:06:06+00:00 |
Next Scan | 2025-09-04T09:06:06+00:00 |
Last Successful Scan
Scanned | 2025-08-20T05:54:50+00:00 |
URL | https://gemiilan.com/robots.txt |
Domain IPs | 89.252.190.90 |
Response IP | 89.252.190.90 |
Found | Yes |
Hash | 987d653834078a5754fd792e5c2fe1e5905418ccee0a4edefb161a228b69d2c1 |
SimHash | 0f4b19160567 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /plugins/ |
Disallow | /libs/ |
Disallow | /includes/ |
Disallow | /print* |
Disallow | /*?sort_by= |
Disallow | /*%26sort_by%3D |
Disallow | /*?sort_type= |
Disallow | /*%26sort_type%3D |
Disallow | /*confirm.html* |
Disallow | /*listing-details.html* |
Disallow | /*print.html* |
Disallow | /404.html* |
Disallow | /*listing-remove.html* |
Disallow | /*newsletter.html* |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Other Records
Field | Value |
---|---|
sitemap | https://www.gemiilan.com/sitemap.xml |
Comments