mindat.org
robots.txt

Robots Exclusion Standard data for mindat.org

Resource Scan

Scan Details

Site Domain mindat.org
Base Domain mindat.org
Scan Status Ok
Last Scan2024-09-20T17:11:35+00:00
Next Scan 2024-09-27T17:11:35+00:00

Last Scan

Scanned2024-09-20T17:11:35+00:00
URL https://mindat.org/robots.txt
Redirect https://www.mindat.org/robots.txt
Redirect Domain www.mindat.org
Redirect Base mindat.org
Domain IPs 104.26.12.92, 104.26.13.92, 172.67.74.189, 2606:4700:20::681a:c5c, 2606:4700:20::681a:d5c, 2606:4700:20::ac43:4abd
Redirect IPs 104.26.12.92, 104.26.13.92, 172.67.74.189, 2606:4700:20::681a:c5c, 2606:4700:20::681a:d5c, 2606:4700:20::ac43:4abd
Response IP 104.26.13.92
Found Yes
Hash f46ad901b1a8c1d168603c31646aa4a37d30871f66d470679a79b866eca38e9d
SimHash 0305dc6362b1

Groups

*

Rule Path
Disallow /links.php
Disallow /dirforward.php
Disallow /bannerclick.php
Disallow /photolocstats.php
Disallow /mineraledit.php
Disallow /minid_merge.php
Disallow /minclick.php
Disallow /click.php
Disallow /bfood-start.html
Disallow /newgalleryp.php
Disallow /gallery.php
Disallow /user-2022.html
Disallow /minid-merge.php
Disallow /paleoimg.php
Disallow /live_upd.php
Disallow /wiki_load.php
Disallow /gbif_thumbs/

amazonbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

gogator

Rule Path
Disallow /

nutch

Rule Path
Disallow /

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10