klaviano.com
robots.txt

Robots Exclusion Standard data for klaviano.com

Resource Scan

Scan Details

Site Domain klaviano.com
Base Domain klaviano.com
Scan Status Ok
Last Scan2024-06-19T14:19:03+00:00
Next Scan 2024-06-26T14:19:03+00:00

Last Scan

Scanned2024-06-19T14:19:03+00:00
URL https://klaviano.com/robots.txt
Domain IPs 172.66.40.151, 172.66.43.105, 2606:4700:3108::ac42:2897, 2606:4700:3108::ac42:2b69
Response IP 172.66.40.151
Found Yes
Hash a7d2b808efbd91676925ceefe430b1e9ecdeed34cf3ffea3415cdd13be26b0bf
SimHash 4b4399574f35

Groups

*

Rule Path
Allow /
Allow *.css
Allow *.js
Allow *.png
Allow *.jpg
Allow *.webp
Allow *.xml
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*confirm.html*
Disallow /*listing-details.html*
Disallow /*print.html*
Disallow /404.html*
Disallow /*listing-remove.html*
Disallow /*comparision-tables.html*
Disallow /*booking-order.html*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.klaviano.com/sitemap.xml

Comments

  • robots.txt
  • Rules generated by the Sitemap plugin
  • Additional rules:
  • Excluded pages:

Warnings

  • `host` is not a known field.