zerca.com
robots.txt

Robots Exclusion Standard data for zerca.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	zerca.com
Base Domain	zerca.com
Scan Status	Ok
Last Scan	2024-09-23T07:41:22+00:00
Next Scan	2024-10-23T07:41:22+00:00

Last Scan

Scanned	2024-09-23T07:41:22+00:00
URL	https://zerca.com/robots.txt
Redirect	https://www.zerca.com/robots.txt
Redirect Domain	www.zerca.com
Redirect Base	zerca.com
Domain IPs	213.4.63.50
Redirect IPs	213.4.63.50
Response IP	213.4.63.50
Found	Yes
Hash	620e7677e46559f2c76f07593122a7167b68bd672b8874b6eb52fd136857083b
SimHash	3a701f9eece8

Groups

*

Rule	Path
Disallow	/cart
Disallow	/checkout
Disallow	/my-account

Rule

Path

Disallow

/cart

Disallow

/checkout

Disallow

/my-account

Other Records

Field	Value	Comment
crawl-delay	10	10 seconds between page requests

Field

Value

Comment

crawl-delay

10 seconds between page requests

cazoodlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

xovibot

Rule	Path
Disallow	/

Rule

Path

Disallow

screaming frog seo spider

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	/sitemap.xml

Field

Value

sitemap

/sitemap.xml

Comments

For all robots
Block access to specific groups of pages
Allow search crawlers to discover the sitemap
Block CazoodleBot as it does not present correct accept content headers
Block MJ12bot as it is just noise
Block dotbot as it cannot parse base urls properly
Block Gigabot

Warnings

`request-rate` is not a known field.
`visit-time` is not a known field.

zerca.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

cazoodlebot

mj12bot

dotbot/1.0

gigabot

ahrefsbot

semrushbot

semrushbot-sa

sistrix

dotbot

rogerbot

ia_archiver

seokicks-robot

searchmetricsbot

spbot

xovi

xovibot

screaming frog seo spider

seekport crawler

Other Records

Comments

Warnings

zerca.com
robots.txt