estatepolis.com
robots.txt

Robots Exclusion Standard data for estatepolis.com

Resource Scan

Scan Details

Site Domain estatepolis.com
Base Domain estatepolis.com
Scan Status Ok
Last Scan2024-06-14T06:41:24+00:00
Next Scan 2024-07-14T06:41:24+00:00

Last Scan

Scanned2024-06-14T06:41:24+00:00
URL https://estatepolis.com/robots.txt
Domain IPs 104.21.74.211, 172.67.162.233, 2606:4700:3033::6815:4ad3, 2606:4700:3033::ac43:a2e9
Response IP 104.21.74.211
Found Yes
Hash 801d1070103eddff47c4a18d328b49c88dd9d143994ac289bb6d54172de7dd5f
SimHash 0f0a59560577

Groups

*

Rule Path
Allow /
Disallow /plugins/
Disallow /libs/
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*%26sort_type%3D
Disallow /*confirm.html*
Disallow /*listing-details.html*
Disallow /*print.html*
Disallow /404.html*
Disallow /*listing-remove.html*
Disallow /*newsletter.html*
Disallow /*pdf-export.html*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.estatepolis.com/sitemap.xml

Comments

  • robots.txt
  • Rules generated by the Sitemap plugin
  • Excluded pages:

Warnings

  • `host` is not a known field.