cruiserlog.com
robots.txt

Robots Exclusion Standard data for cruiserlog.com

Resource Scan

Scan Details

Site Domain cruiserlog.com
Base Domain cruiserlog.com
Scan Status Ok
Last Scan2024-11-02T11:20:13+00:00
Next Scan 2024-11-09T11:20:13+00:00

Last Scan

Scanned2024-11-02T11:20:13+00:00
URL https://cruiserlog.com/robots.txt
Redirect https://www.cruiserlog.com/robots.txt
Redirect Domain www.cruiserlog.com
Redirect Base cruiserlog.com
Domain IPs 104.25.178.20, 104.25.179.20, 172.67.82.167
Redirect IPs 104.25.178.20, 104.25.179.20, 172.67.82.167
Response IP 172.67.82.167
Found Yes
Hash 14b6dc9c0c4c44435fca4a31b605a4c1f284324e42799e41d41819892c93826d
SimHash 4bbd9064a713

Groups

*

Rule Path
Disallow /forums/clientscript/
Disallow /forums/includes/
Disallow /forums/install/
Disallow /forums/customavatars/
Disallow /forums/signatureuploads/

*

Rule Path
Disallow /forums/printthread.php
Disallow /forums/cron.php

googlebot

Rule Path
Disallow /forums/external-link/

*

Rule Path
Disallow /ad_tags/

boardtracker

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ia_archiver

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

amazonbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

the knowledge ai

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

Other Records

Field Value
sitemap https://www.cruiserlog.com/forums/sitemap_index.xml.gz

Comments

  • Forum Folders
  • Forum Files
  • Outbound Link Exclude
  • Backup Ad Tags