croc.ru
robots.txt

Robots Exclusion Standard data for croc.ru

Resource Scan

Scan Details

Site Domain croc.ru
Base Domain croc.ru
Scan Status Ok
Last Scan2024-11-09T08:21:11+00:00
Next Scan 2024-12-09T08:21:11+00:00

Last Scan

Scanned2024-11-09T08:21:11+00:00
URL https://croc.ru/robots.txt
Domain IPs 178.248.234.15
Response IP 178.248.234.15
Found Yes
Hash d40a07408d38879c7f0d4e13c2b05d33cf828797ecc64f59a6d4de73a4783951
SimHash a301a811c152

Groups

*

Rule Path
Disallow *?
Disallow *%3D
Disallow *wp-json*
Disallow /cities/*
Allow */?pg=
Allow */?sf_paged=
Disallow */author/
Disallow */posts_industry/
Disallow */post_solutions/
Disallow /coming-soon/
Disallow /en/*
Disallow *.pdf
Disallow /solution_type/
Disallow /solution/*
Disallow /profiles/*
Disallow /cases/*
Allow /industries
Disallow /industries/*/
Disallow /industries/*/*/
Disallow /promo/haas/

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.croc.ru/sitemap_index.xml