rockcrawler.com
robots.txt

Robots Exclusion Standard data for rockcrawler.com

Resource Scan

Scan Details

Site Domain rockcrawler.com
Base Domain rockcrawler.com
Scan Status Ok
Last Scan2024-11-19T04:42:02+00:00
Next Scan 2024-11-26T04:42:02+00:00

Last Scan

Scanned2024-11-19T04:42:02+00:00
URL https://rockcrawler.com/robots.txt
Domain IPs 147.75.201.43
Response IP 147.75.201.43
Found Yes
Hash a186f5534b1d900eb8c3e7574b3e8399009b3e97f0143404ba47ecaa94429b65
SimHash 6a79c850c215

Groups

boardtracker

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

topify

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value Comment
crawl-delay 4 specifies a delay of x seconds

*

Rule Path
Disallow /readersrides/*
Disallow /landuse/admin/
Disallow /cgi-bin/
Disallow /wp-admin/

Other Records

Field Value
sitemap https://www.rockcrawler.com/sitemap_index.xml
sitemap https://www.rockcrawler.com/sitemap.xml.gz
sitemap https://www.rockcrawler.com/forum/sitemap.php
sitemap https://www.rockcrawler.com/sitemap_archive.xml
sitemap https://www.rockcrawler.com/sitemap_images.xml

Comments

  • global