essexcricket.com
robots.txt

Robots Exclusion Standard data for essexcricket.com

Resource Scan

Scan Details

Site Domain essexcricket.com
Base Domain essexcricket.com
Scan Status Ok
Last Scan2024-09-25T03:27:41+00:00
Next Scan 2024-10-25T03:27:41+00:00

Last Scan

Scanned2024-09-25T03:27:41+00:00
URL https://www.essexcricket.com/robots.txt
Domain IPs 94.102.155.202
Response IP 94.102.155.202
Found Yes
Hash a77594d6fc383b29930c70d52c3f176ab0045a50da49fca981e2f5a50fa8cc98
SimHash 125674726195

Groups

*

Rule Path
Disallow /*?
Disallow /admin/
Disallow /static/
Disallow /images/
Disallow /css/
Disallow /services/
Disallow /pleasedontindex.htm
Disallow /nobots.html
Disallow /*.axd$
Disallow /*.ashx$

Other Records

Field Value
crawl-delay 30

wbsearchbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

ahrefs.com/robot/

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

buck

Rule Path
Disallow /

googlebot

Rule Path
Allow /events/default.aspx?ical=true*

testignorebot

No rules defined. All paths allowed.