lignea24.com
robots.txt

Robots Exclusion Standard data for lignea24.com

Resource Scan

Scan Details

Site Domain lignea24.com
Base Domain lignea24.com
Scan Status Ok
Last Scan2024-09-08T02:04:49+00:00
Next Scan 2024-10-08T02:04:49+00:00

Last Scan

Scanned2024-09-08T02:04:49+00:00
URL https://lignea24.com/robots.txt
Domain IPs 2600:9000:269b:1c00:e:b58:81c0:93a1, 2600:9000:269b:3600:e:b58:81c0:93a1, 2600:9000:269b:5a00:e:b58:81c0:93a1, 2600:9000:269b:8a00:e:b58:81c0:93a1, 2600:9000:269b:ae00:e:b58:81c0:93a1, 2600:9000:269b:ba00:e:b58:81c0:93a1, 2600:9000:269b:d000:e:b58:81c0:93a1, 2600:9000:269b:ee00:e:b58:81c0:93a1, 3.160.196.104, 3.160.196.126, 3.160.196.60, 3.160.196.90
Response IP 65.9.112.46
Found Yes
Hash f6151d57f7b6ce05743d490a23b1b70950e2fcb94a972bf00f7f3474a403ae62
SimHash 6856579defe0

Groups

*

Rule Path
Disallow /de/cart
Disallow /de/checkout
Disallow /de/my-account
Disallow /de/wishlist

Other Records

Field Value Comment
crawl-delay 10 10 seconds between page requests

cazoodlebot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://lignea24.com/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot

Warnings

  • `request-rate` is not a known field.
  • `visit-time` is not a known field.