protorg.org
robots.txt

Robots Exclusion Standard data for protorg.org

Resource Scan

Scan Details

Site Domain protorg.org
Base Domain protorg.org
Scan Status Ok
Last Scan2025-09-20T21:55:58+00:00
Next Scan 2025-09-27T21:55:58+00:00

Last Scan

Scanned2025-09-20T21:55:58+00:00
URL https://protorg.org/robots.txt
Domain IPs 176.118.166.140
Response IP 176.118.166.140
Found Yes
Hash 44f203e4c409da9800070dc4f46961272d5203a4cc21a3837e7f07520c02de30
SimHash d995e823a600

Groups

*

Rule Path
Disallow /*?*
Disallow /*%26*
Disallow /_openstat
Disallow /t/*/g-*
Disallow /go/*
Disallow /goals/*
Disallow */search/
Disallow /statistic/*
Disallow /files/temp/
Disallow /reg/*
Disallow */g/*
Disallow /shopcart
Disallow /shopcart/*
Disallow */cart/*
Disallow */order/*
Disallow */checkout/*
Disallow /partners/*
Disallow /map-address/*
Disallow /data/*
Disallow /cds/*
Disallow /ajax/*
Disallow /privacy-policy/
Disallow */p/v*
Allow /inc/*
Allow /lego/*
Allow /pics/*
Allow /local_files/*
Allow /files/*
Allow /i/*
Allow *utm_*
Allow /frontend/dist/*
Allow /yml-export/*
Allow */p/*/?o=*

bubing

Rule Path
Disallow /

yadirectbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

yandexdirect

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

mail.ru

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://protorg.org/sitemap.xml.gz