top10deals.ru
robots.txt
Robots Exclusion Standard data for top10deals.ru
Resource Scan
Scan Details
Site Domain | top10deals.ru |
Base Domain | top10deals.ru |
Scan Status | Ok |
Last Scan | 2024-09-22T16:04:39+00:00 |
Next Scan | 2024-10-22T16:04:39+00:00 |
Last Scan
Scanned | 2024-09-22T16:04:39+00:00 |
URL | https://top10deals.ru/robots.txt |
Domain IPs | 104.21.15.237, 172.67.208.160, 2606:4700:3031::6815:fed, 2606:4700:3037::ac43:d0a0 |
Response IP | 104.21.15.237 |
Found | Yes |
Hash | 1a5ccf78309eb64ea108c6a15a6d42974c154b3b76eff31d686cdd81c7296793 |
SimHash | 720eaa40c333 |
Groups
moget
ichiro
naverbot
yeti
baiduspider
baiduspider-video
baiduspider-image
youdaobot
seokicks-robot
mj12bot
ahrefsbot
solomonobot
rogerbot
blekkobot
sistrix
proximic
turnitinbot
psbot
gigabot
irlbot
twiceler
cazoodlebot
webinject
spbot
grapeshot
shopwiki
twiceler
cazoodlebot
webinject
irlbot
gigabot
turnitinbot
psbot
riddler
lcc
scrapy
semrushbot
semrushbot-sa
ccbot
amazonbot
claudebot
google-extended
gptbot
bingbot
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /statso |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Warnings
- `host` is not a known field.
- `request-rate` is not a known field.
- `visit-time` is not a known field.