petsitllc.com
robots.txt

Robots Exclusion Standard data for petsitllc.com

Resource Scan

Scan Details

Site Domain petsitllc.com
Base Domain petsitllc.com
Scan Status Ok
Last Scan2024-09-24T02:15:23+00:00
Next Scan 2024-10-24T02:15:23+00:00

Last Scan

Scanned2024-09-24T02:15:23+00:00
URL https://petsitllc.com/robots.txt
Redirect https://www.petsitllc.com/robots.txt
Redirect Domain www.petsitllc.com
Redirect Base petsitllc.com
Domain IPs 192.124.249.158
Redirect IPs 192.124.249.158
Response IP 192.124.249.158
Found Yes
Hash a8a62a3f0c35322ff4ca04e35691d94cc1eb3980cd64ccecdf51bf39f8966318
SimHash 091d9917f351

Groups

ninjabot

Rule Path
Allow /

*

Rule Path
Disallow /admin/
Disallow /members/
Disallow /process/
Disallow /documents/
Disallow /newsletter/
Disallow /library/
Disallow /emails/
Disallow /webfiles/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

googlebot-image

Rule Path
Disallow /webfiles/*

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10