claw.ru
robots.txt

Robots Exclusion Standard data for claw.ru

Resource Scan

Scan Details

Site Domain claw.ru
Base Domain claw.ru
Scan Status Ok
Last Scan2024-07-02T23:35:36+00:00
Next Scan 2024-07-09T23:35:36+00:00

Last Scan

Scanned2024-07-02T23:35:36+00:00
URL https://claw.ru/robots.txt
Domain IPs 193.106.172.49
Response IP 193.106.172.49
Found Yes
Hash 5a4b15d6a45041ce8f07894cf741ea315ceccfd530f31a20dc984b030db1b04d
SimHash 0d45a351c7d3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /generator/
Disallow /generator1/
Disallow /generator2/
Disallow /adadmin/
Disallow /forms/

Other Records

Field Value
sitemap https://claw.ru/sitemap2.xml.gz
sitemap https://claw.ru/sitemap2.xml
sitemap https://claw.ru/sitemap.xml
sitemap https://claw.ru/book-readywork/sitemap0.xml
sitemap https://claw.ru/book-readywork/sitemap0.gz
sitemap https://claw.ru/book-readywork/sitemap1.xml
sitemap https://claw.ru/book-readywork/sitemap1.gz
sitemap https://claw.ru/referatti/sitemap3.xml.gz
sitemap https://claw.ru/referatti/sitemap3.xml
sitemap https://claw.ru/referatti/sitemap31.xml.gz
sitemap https://claw.ru/referatti/sitemap31.xml
sitemap https://claw.ru/referatti/sitemap32.xml.gz
sitemap https://claw.ru/referatti/sitemap32.xml
sitemap https://claw.ru/referatti/sitemap33.xml.gz
sitemap https://claw.ru/referatti/sitemap33.xml
sitemap https://claw.ru/referatti/sitemap34.xml.gz
sitemap https://claw.ru/referatti/sitemap34.xml
sitemap https://claw.ru/referatti/sitemap35.xml.gz
sitemap https://claw.ru/referatti/sitemap35.xml
sitemap https://claw.ru/referatti/sitemap36.xml.gz
sitemap https://claw.ru/referatti/sitemap36.xml
sitemap https://claw.ru/referatti/sitemap37.xml.gz
sitemap https://claw.ru/referatti/sitemap37.xml

Warnings

  • `host` is not a known field.