gemiilan.com
robots.txt

Robots Exclusion Standard data for gemiilan.com

Resource Scan

Scan Details

Site Domain gemiilan.com
Base Domain gemiilan.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-08-28T09:06:06+00:00
Next Scan 2025-09-04T09:06:06+00:00

Last Successful Scan

Scanned2025-08-20T05:54:50+00:00
URL https://gemiilan.com/robots.txt
Domain IPs 89.252.190.90
Response IP 89.252.190.90
Found Yes
Hash 987d653834078a5754fd792e5c2fe1e5905418ccee0a4edefb161a228b69d2c1
SimHash 0f4b19160567

Groups

*

Rule Path
Allow /
Disallow /plugins/
Disallow /libs/
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*%26sort_type%3D
Disallow /*confirm.html*
Disallow /*listing-details.html*
Disallow /*print.html*
Disallow /404.html*
Disallow /*listing-remove.html*
Disallow /*newsletter.html*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.gemiilan.com/sitemap.xml

Comments

  • robots.txt
  • Rules generated by the Sitemap plugin
  • Excluded pages: